THE ERGODIC THEORY OF HYPERBOLIC GROUPS 
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O" 

■ Abstract. These notes are a self-contained introduction to the use of dy- 
namical and probabilistic methods in the study of hyperbolic groups. Most of 
this material is standard; however some of the proofs given are new, and some 
results are proved in greater generality than have appeared in the literature. 
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These are notes from a minicourse given at a workshop in Melbourne July 11-15 
2011. There is little pretension to originality; the main novelty is firstly that we 
give a new (and much shorter) proof of Coornaert's theorem on Patterson-Sullivan 
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measures for hyperbolic groups (Theorem 2.5.4), and secondly that we explain how 
to combine the results of Calegari-Fujiwara in [8] with that of Pollicott-Sharp [35] 
to prove central limit theorems for quite general classes of functions on hyperbolic 
groups (Corollary 3.7.5 and Theorem 3.7.6), crucially without the hypothesis that 
the Markov graph encoding an automatic structure is ergodic. 
A final section on random walks is much more cursory. 



2.1. Coarse geometry. The fundamental idea in geometric group theory is to 
study groups as automorphisms of geometric spaces, and as a special case, to study 
the group itself (with its canonical self-action) as a geometric space. This is accom- 
plished most directly by means of the Cayley graph construction. 

Definition 2.1.1 (Cayley graph). Let G be a group and S a (usually finite) gen- 
erating set. Associated to G and S we can form the Cayley graph Cs{G). This is 
a graph with vertex set G, and with an edge from g to gs for all g G G and s £ S. 

The action of G on itself by (left) multiplication induces a properly discontinuous 
action of G on Cs{G) by simplicial automorphisms. 

If G has no 2-torsion, the action is free and properly discontinuous, and the 
quotient is a wedge of l^l circles Xs- In this case, if G has a presentation G = 
{S I R) we can think of Cs{G) as the covering space of Xs corresponding to the 
subgroup of the free group Fs normally generated by R, and the action of G on 
Cs{G) is the deck group of the covering. 



We assume the reader is familiar with the notion of a metric space, i.e. a space 
X together with a symmetric non-negative real-valued function dx on. X y, X 
which vanishes precisely on the diagonal, and which satisfies the triangle inequality 



2. Hyperbolic groups 






Figure 1 . The Cayley graph of F2 = (a, 6 | ) with generating set 
S = {a,h} 
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dx (x, y) + dx (y, z) > dx {x, z) for each triple x,y, z & X . A metric space is a path 
metric space if for each x,y ^ X, the distance dx{x,y) is equal to the infimum of 
the set of numbers L for which there is a 1-Lipschitz map 7 : [0, L] — > X sending 
to X and L to y. It is a geodesic metric space if it is a path metric space and if 
the infimum is achieved on some 7 for each pair x, y; such a 7 is called a geodesic. 
Finally, a metric space is proper if closed metric balls of bounded radius are compact 
(equivalently, for each point x the function d{x, •) : X — M is proper). 

The graph Cs{G) can be canonically equipped with the structure of a geodesic 
metric space. This is accomplished by making each edge isometric to the Euclidean 
unit interval. If S is finite, Cs (G) is proper. Note that G itself inherits a subspace 
metric from Cs{G), called the word metric. We denote the word metric by ds, and 
define l^ls (or just \g\ if S is understood) to be ds{id,g). Observe that ds{g, h) = 
\9~^h\s = \h~^g\s and that \g\s is the length of the shortest word in elements of S 
and their inverses representing the element g. 

The most serious shortcoming of this construction is its dependence on the choice 
of a generating set S. Different choices of generating set S give rise to different 
spaces Cs{G) which are typically not even homeomorphic. The standard way to 
resolve this issue is to coarsen the geometric category in which one works. 

Definition 2.1.2. Let X. dx and Y^dy be metric spaces. A map f : X ^ Y 
{not assumed to be continuous) is a quasi-isometric map if there are constants 
> 1, e > so that 

K~'^dx{xi,X2) - e < dY{f{xi),f{x2)) < Kdx{xi,X2) + e 

for all xi,X2 € X. It is said to be a quasi-isometry if further f{X) is a net in Y; 
that is, if there is some R so that Y is equal to the ii-neighborhood of f{X). 

One also uses the terminology K, e quasi-isometric map or K, e quasi-isometry if 
the constants are specified. Note that a K,0 quasi-isometric map is the same thing 
as a K bilipschitz map. The best constant K is called the multiplicative constant, 
and the best e the additive constant of the map. 

We denote the i?- neighborhood of a set S by NjiCE,). Hence a quasi-isometry is 
a quasi-isometric map for which Y = Nji{f{X)) for some R. 

Remark 2.1.3. It is much more common to use the terminology quasi-isometric 
embedding instead of quasi-isometric map as above; we consider this terminology 

misleading, and therefore avoid it. 

Lemma 2.1.4. Quasi-isometry is an equivalence relation. 

Proof. Reflexivity and transitivity are obvious, so we must show symmetry. For 
each y gY choose x G X with dyiy, f{x)) < R (such an x exists by definition) and 

define g{y) = x. Observe dyiy, fg{y)) ^ ^ by definition. Then 

dx{g{yi),g{y2)) < Kdy{fg{y^),fg{y2))+Ke < Kdy{yuy2) + K{e + 2R) 
Similarly, 

dx{g{yi).g{y2)) > K-^dy{fg{y^), 75(2/2)) - K-\ > K-^dy{yuy2) - K-\e + 2R) 
proving symmetry. □ 

Note that the compositions fg and gf as above move points a bounded distance. 
One can define a category in which objects are equivalence classes of metric spaces 
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under the equivalence relation generated by thickening (i.e. isometric inclusion as 
a net in a bigger space), and morphisms are equivalence classes of quasi-isometric 
maps, where two maps are equivalent if their values on each point are a uniformly 
bounded distance apart. In this category, quasi-isometries are isomorphisms. In 
particular, the set of quasi-isometries of a metric space X, modulo maps that move 
points a bounded distance, is a group, denoted QI(X), which only depends on the 
quasi-isometry type of X. Determining QI(X), even for very simple spaces, is 
typically extraordinarily difficult. 

Example 2.1.5. A metric space X,dx is quasi-isometric to a point if and only if it 
has bounded diameter. A Cayley graph Cs{G) (for S finite) is quasi-isometric to a 
point if and only if G is finite. 

Example 2.1.6. US and T are two finite generating sets for a group G then the 
identity map from G to itself is a quasi-isomctry (in fact, a bilipschitz map) of G, ds 
to G,dT- For, there are constants Ci and C2 so that c?t(s) < Ci for all s £ S, and 
ds{t) < C2 for all t€T, and therefore C2^dT{g, h) < ds{g, h) < Cidrig, h). 

Because of this, the quasi-isometry class of G, ds is independent of the choice of 
finite generating set, and we can speak unambiguously of the quasi-isometry class 
of G. 

The Schwarz Lemma connects the geometry of groups to the geometry of spaces 
they act on. 

Lemma 2.1.7 (Schwarz Lemma). Let G act properly dis continuously and cocom- 

pactly by isometrics on a proper geodesic m,etric space X . Then G is finitely gen- 
erated by some set S, and the orbit map G ^ X sending g to gx (for any x € X) 
is a quasi-isometry from G, ds to X. 

Proof. Since X is proper and G acts cocompactly there is an R so that GNn{x) = 
X. Note that Gx is a net, since every point of X is contained in some translate gB 
and is therefore within distance Roi gx. 

Let B = N2R+i{x). Since G acts properly discontinuously, there are only finitely 
many g in G for which gB fl S is nonempty; let S be the nontrivial elements of this 
set. 

Now, if g,h G G are arbitrary, let 7 be a geodesic in X from gx to hx. Pa- 
rameterize 7 by arclcngth, and for each integer i G (0, |7|) let gi be such that 
dx{giX,j{i)) < R. Then g^ g^+i e S and therefore 

dsig, h) = \g-^h\ < |7| + 1 = d{gx, hx) + 1 

which shows incidentally that S generates G. 

Conversely, if L := ds{g,h) and gi is a sequence of elements with go = g and 
gL = h and each g^^gi+i e S, then there is a path 7^ from giX to gi+ix of length 
at most AR + 2, and the concatenation of these paths certifies that 

d{gx,hx) < iAR + 2)\g-'^h\ = {AR+2)ds{g,h) 

This completes the proof of the lemma. □ 

Example 2.1.8. If G is a group and is a subgroup of finite index, then G and H are 

quasi-isometric (for, both act properly discontinuously and cocompactly on Cs{G)). 
Two groups are said to be commensurable if they have isomorphic subgroups of finite 
index; the same argument shows that commensurable groups are quasi-isometric. 
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Example 2.1.9. Any two regular trees of (finite) valence > 3 are quasi-isometric; 
for, any such tree admits a cocompact action by a free group of finite rank, and 
any two free groups of finite rank are commensurable. 

Example 2.1.10. The set of ends of a geodesic metric space is a quasi-isometry 
invariant. A famous theorem of Stallings [39] says that a finitely generated group 
with more; than one end splits over a finite subgroup; it follows that the property 
of splitting over a finite subgroup is a quasi-isometry invariant. 

Finiteness of the edge groups (in a splitting) is detected quasi-isometrically by 
the existence of separating compact subsets. Quasi-isometry can further detect 
the finiteness of the vertex groups, and in particular one observes that a group is 
quasi-isometric to a free group if and only if it is virtually free. 

Example 2.1.11. Any two groups that act cocompactly and properly discontinu- 

ously on the same space X are quasi-isomctric. For example, if Mi, M2 arc closed 
Riemannian manifolds with isometric universal covers, then 7ri(Mi) and ni{M2) are 
quasi- isometric. It is easy to produce examples for which the groups in question 
are not commensurable; for instance, a pair of closed hyperbolic 3-manifolds Mi, 
M2 with different invariant trace fields (see [27]). 

Remark 2.1.12. In the geometric group theory literature. Lemma 2.1.7 is often 
called the "Milnor-Svarc (or Svarc-Milnor) Lemma"; "Svarc" here is in fact the 
well-known mathematical physicist Albert Schwarz; it is our view that the orthog- 
raphy "Svarc" tends to obscure this. Actually, the content of this Lemma was first 
observed by Schwarz in the early 50's and only rediscovered 15 years later by Milnor 
at a time when the work of Soviet mathematicians was not widely disseminated in 
the west. 

2.2. Hyperbolic spaces. In a geodesic metric space a geodesic triangle is just a 
union of three geodesies joining three points in pairs. If the three points are x, y, z 
we typically denote the (oriented) geodesies by xy, yz and zx respectively; this 
notation obscures the possibility that the geodesies in question are not uniquely 
determined by their endpoints. 

Definition 2.2.1. A geodesic metric space X, dx is 5 -hyperbolic if for any geodesic 
triangle, each side of the triangle is contained in the (5-neighborhood of the union 
of the other two sides. A metric space is hyperbolic if it is ^-hyperbolic for some 5. 

One sometimes says that geodesic triangles are 6-thin. 
Example 2.2.2. A tree is 0-hyperbolic. 

Example 2.2.3. Hyperbolic space (of any dimension) is J-hyperbolic for a uniform 
S. 

Example 2.2 A. If X is a simply-connected complete Riemannian manifold with cur- 
vature bounded above by some K <0 then X is 5-hyperbolic for some 6 depending 
on K. 

Definition 2.2.5. A geodesic metric space X is CAT(/C) for some K if triangles 
are thinner than comparison triangles in a space of constant curvature K. This 
means that if xyz is a geodesic triangle in X, and x'y'z' is a geodesic triangle in a 

complete simply connected Riemannian manifold Y of constant curvature K with 
edges of the same lengths, and (p : xyz — )■ x'y'z' is an isometry on each edge, then 
for any w G yz we have dx{x, w) < dyix', (f){w)). 



6 



DANNY CALEGARI 




Figure 2. A J-thin triangle; the gray tubes have thickness S. 

The initials CAT stand for Cartan-Alexandrov-Toponogov, who made substan- 
tial contributions to the theory of comparison geometry. 

Example 2.2.6. From the definition, a CAT{K) space is (5-hyperbolic whenever the 
complete simply connected Riemannian 2-manifold of constant curvature K is 6- 
hyperbolic. Hence a CAT{K) space is hyperbolic if K <0. 

Example 2.2.7. Nearest point projection to a convex subset of a CAT{K) space 
with ii' < is distance nonincreasing. Therefore the subspace metric and the path 
metric on a convex subset of a CAT(i^) space agree, and such a subspace is itself 
CAT{K). 

Thinness of triangles implies thinness of arbitrary polygons. 

Example 2.2.8. Let X be (5- hyperbolic and let abed be a geodesic quadrilateral. 
Then either there are points on ab and cd at distance < 26 or there are points on 
ad and be at distance < 26, or possibly both. 




Figure 3. Two ways that a quadrilateral can be thin 
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The number of essentially distinct ways in which an n-gon can be thin is equal 
to the nth Catalan number. By cutting up a polygon into triangles and examining 
the implications of 5-thinness for each triangle, one can reason about the geometry 
of complicated configurations in (5-hyperbolic space. 

Lemma 2.2.9. Let X be 5-hyperbolic, let -f be a geodesic segment/ray/line in X, 
and let p € X. Then there is a point q on j realizing the infimum of distance from 
p to points on 7, and moreover for any two such points q, q' we have dx{q, q') < 4(5. 

Proof. The existence of some point realizing the infimum follows from the proper- 
ness of d{p, •) : 7 — > R, valid for any geodesic in any metric space. 

Let q,q' be two such points, and if d{q,q') > AS let q" be the midpoint of the 
segment qq' , so d(q,q") = d(q",q') > 2S. Without loss of generality there is r on 
pq with d{r, q") < 6 hence d{r, q) > 6. But then 

d{p, q") < d{p, r) + d(r, q") < d{p, r) + S < d{p, r) + d{r, q) = dip, q) 

contrary to the fact that q minimizes the distance from p to points on 7. □ 

Lemma 2.2.9 says that there is an approximate "nearest point projection" map tt 

from X to any geodesic 7 (compare with Example 2.2.7). This map is not continu- 
ous, but nearby points must map to nearby points, in the sense that d{Tr{x),Tr{y)) < 
d{x,y) + S6. 

We would now like to show that the property of being hyperbolic is preserved un- 
der quasi-isomctry. The problem is that the property of (5-hypcrbolicity is expressed 
in terms of geodesies, and quasi-isometries do not take geodesies to geodesies. 

A quasigeodesic segment /ray/line is the image of a segment /ray /line in K under 
a quasi-isometric map. For infinite or semi-infinite intervals this definition has 
content; for finite intervals this definition has no content without specifying the 
constants involved. Hence we can talk about a,K,€ quasigeodesic segment/ray/line. 

Lemma 2.2.10 (Morse lemma). Let X,dx be a proper 5-hyperbolic space. Then 

for any K, e there is a constant C (depending in an explicit way on K,e,5) so that 
any K, e quasigeodesic 7 is within Hausdorff distance C of a genuine geodesic 7^ . 
If J has one or two endpoints, 7^ can be chosen to have the same endpoints. 

Proof. If 7 is noncompact, it can be approximated on compact subsets by finite 

segments 7^. If we prove the lemma for finite segments, then a subsequence of the 
7?, converging on compact subsets, will limit to 7^ with the desired properties (here 
is where we use properness of X). So it suffices to prove the lemma for 7 a segment. 

In this case choose any 7^ with the same endpoints as 7. We need to estimate 
the Hausdorfl: distance from 7 to 7^. Fix some constant C and suppose there are 
points p,p' on 7 that are both distance C from 7^, but d{r,^^) > C for all r on 7 
between p and p' . Choose pi a sequence of points on 7 and qi a sequence of points 
on 7^ closest to the pi so that d{qi, gi+i) = 115. 

Consider the quadrilateral piPj+igj+iQ'i. By Example 2.2.8 either there are close 
points on PiPi+i and gi^i+i, or close points on piqi and pi+ig^+i (or possibly both). 
Suppose there are points on piqi and rj+i on pj+i 1 with d{ri , rj+i ) < 25. Then 
any nearest point projections of and r^+i to 7^ must be at most distance 105 
apart. But qi and qi+i are such nearest point projections, by definition, and satisfy 
d{qi,qi+i) = 115. So it must be instead that there are points on PiPi+i and Sj 
on qiqi+i which are at most 25 apart. But this means that d{pi,pi+i) > 2C — 45, 
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SO the length of 7 between p and p' is at least (2C — 4)(i(g, where g, 5' are 

points on 7 closest top,p'. On the other hand, d{p,p') < 2C + d{q,q'). Since 7 is a 
K, € quasigeodesic, if d{q, q') is big enough, we get a uniform bound on C in terms 
of K, €, 5. The remaining case where d{q, q') is itself uniformly bounded but C is 
unbounded quickly leads to a contradiction. □ 

Corollary 2.2.11. Let Y he 6-hyperbolic and let f : X ^ Y be a K,e quasi- 
isonietry. Then X is S' -hyperbolic for some S. Hence the property of being hyperbolic 

is a quasi-isometry invariant. 

Proof. Let F be a geodesic triangle in X with vertices a, &, c. Then the edges of 
/(r) are K, e quasigeodesics in Y, and are therefore within Hausdorff distance C of 
geodesies with the same cndpoints. It follows that every point on f{ah) is within 
distance 2C + 5 of /(ac) U f{hc) and therefore every point on ah is within distance 
K{2C + 5) + e of ac U he. □ 

The Morse Lemma lets us promote quasigeodesics to (nearby) geodesies. The 
next lemma says that quasigeodesity is a local condition. 

Definition 2.2.12. A path 7 in X is a k-local geodesic if the subsegments of length 
< k are geodesies. Similarly, 7 is a k-local K, e quasigeodesic if the subsegments of 
length < k are K, e quasigeodesics. 

Lemma 2.2.13 (A;-local geodesies). Let X be a d-hyperbolic geodesic space, and let 
k > 86. Then any k-local geodesic is K, e quasigeodesic for K, e depending explicitly 
on S. 

More generally, for any K, e there is ak and constants K' , e' so that any k-local 
K,e quasigeodesic is a K',e' quasigeodesic. 

Proof. Let 7 be a fc-local geodesic segment from p to q, and let 7^ be any geodesic 

from p to q. Let r be a point on 7 furthest from 7^, and let r be the midpoint of 
an arc r'r" of 7 of length 86. By hypothesis, r'r" is actually a geodesic. Let s' and 
s" be points on 7^ closest to r' and r". The point r is within distance 26 either 
of 7^ or of one of the sides r's' or r"s" . If the latter, we would get a path from r 
to s' or s" shorter than the distance from r' or r", contrary to the definition of r. 
Hence the distance from r to 7^ is at most 2^, and therefore 7 is contained in the 
26 neighborhood of 7^. 

Now let TT : 7 — >■ 7^ take points on 7 to closest points on 7^. Since tt moves points 
at most 26, it is approximately continuous. Since 7 is a fc-local geodesic, the map 
TT is approximately monotone; i.e. ii pi arc points on 7 with d{pi,pi+i) = k moving 
monotonely from one end of 7 to the other, then d{Tr{pi), 7r(pj+i)) > k — 46 and the 
projections also move monotonely along 7. In particular, d{pi,pj) > {k — 45)|i — j\ 
and TT is a quasi-isomctry. The constants involved evidently depend only on 6 and 
fc, and the multiplicative constant evidently goes to 1 as fc gets large. 

The more general fact is proved similarly, by using Lemma 2.2.10 to promote 
local quasigeodesics to local geodesies, and then back to global quasigeodesics. □ 

2.3. Hyperbolic groups. Corollary 2.2.11 justifies the following definition: 

Definition 2.3.1. A group G is hyperbolic if Cs{G) is 5-hyperbolic for some 6 for 
some (and hence for any) finite generating set S. 
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Example 2.3.2. Free groups are hyperbolic, since their Cayley graphs (with respect 
to a free generating set) are trees which are 0-hyperbohc. 

Example 2.3.3. Virtually free groups, being precisely the groups quasi-isometric to 
trees, are hyperbolic. A group quasi-isometric to a point or to R is finite or virtually 
Z respectively; such groups are called elementary hyperbolic groups; all others are 
nonelem,entary. 

Example 2.3.4. Fundamental groups of closed surfaces with negative Eulcr charac- 
teristic are hyperbolic. By the uniformization theorem, each such surface can be 
given a hyperbolic metric, exhibiting tti as a cocompact group of isometrics of the 
hyperbolic plane. 

Example 2.3.5. A Kleinian group is a finitely generated discrete subgroup of the 
group of isometrics of hyperbolic 3-space. A Kleinian group G is is convex cocompact 
if it acts cocompactly on the convex hull of its limit set (in the sphere at infinity). 
Such a convex hull is CAT(— 1), so a convex cocompact Kleinian group is hyperbolic. 
See e.g. [28] for an introduction to Kleinian groups. 

Lemma 2.3.6 (invariant quasiaxis). Let G he hyperbolic. Then there are finitely 
many conjugacy classes of torsion elements ( and therefore a hound on the order of 
the torsion) and there are constants K, e so that for any nontorsion element g there 
is a K,e quasigeodesic 7 invariant under g on which g acts as translation. 

Proof. Let g € G he given. Consider the action of g on the Cayley graph Cs{G). 

The action is simplicial, so p — > d(p, gp) has no strict local minima in the interior 
of edges, and takes integer values at the vertices (which correspond to elements of 
G). It follows that there is some h for which d{h,gh) is minimal, and we can take 
h to be an clement of G (i.e. a vertex). If d{h,gh) = k > 8S then we can join h to 
gh by a geodesic a and let 7 = U^gV. Note that g acts on 7 by translation through 
distance k; since this is the minimum distance that g moves points of G, it follows 
that 7 is a fc-local geodesic (and therefore a K,e quasigeodesic by Lemma 2.2.13). 
Note in this case that g has infinite order. 

Otherwise there is h moved a least distance by g so that d{h,gh) < 86. Since 
G acts cocompactly on itself, there arc only finitely many conjugacy classes of 
elements that move some point any uniformly bounded distance, so if g is torsion 
we are done. If g is not torsion, its orbits are proper, so for any T there is an 
N so that d{h,g^h) > T; choose T (and N) much bigger than some fixed (but 
big) n. Let 7 be a geodesic from h to g^h. Then for any < i < n the geodesic 
5*7 has endpoints within distance 8Sn of the endpoints of 7. On the other hand, 
I7I = T 85n so 7 contains a segment a of length at least T — 16Sn — 0{S) such 
that (?V is contained in the 26 neighborhood of 7 for < z < n. To see this, 
consider the quadrilateral with successive vertices h, g^h, g'^^^h and g'^h. Two 
nonadjaccnt sides must contain points which are at most 26 apart. Since N ^ i, 
the sides must be 7 and (7*7. We find a and g^a in the region where these two 
geodesies are close. 

Consequently, for any p G cr the sequence p, gp, • • • , g"p is a K,e quasigeodesic 
for some uniform K, e independent of n. In particular there is a constant C (in- 
dependent of n) so that d{p, g'^p) > iC for < i < n, and therefore the infinite 
sequence g^p for i G li is an (nC)-local e quasigeodesic. Since K, e is fixed, if n is 
big enough, this infinite sequence is an honest K', e' quasigeodesic invariant under 
g, by Lemma 2.2.13. Here K', e' depends only on i5 and G, and not on g. □ 
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Lemma 2.3.6 can be weakened considerably, and it is frequently important to 
study actions which are not necessarily cocompact on (5-hyperbolic spaces which are 
not necessarily proper. The quasigeodesic 7 invariant under g is called a quasiaxis. 
Quasiaxes in (5-hyperbolic spaces are (approximately) unique: 

Lemma 2.3.7. Let G be hyperbolic, and let g have infinite order. Let 7 and 7' 

be g-invariant K.e quasigeodesics (i.e. quasiaxes for g). Then 7 a,nd 7' are a 
finite Hausdorff distance apart, and this finite distance depends only on K, e and S. 
Consequently the centralizer C{g) is virtually Z. 

Proof. Let p G 7 and p' e 7' a closest point to p. Since g acts on both 7 and 7' 
cocompactly, there is a constant C so that every point in 7 or 7' is within C from 
some point in the orbit of p ov p'. This implies that the Hausdorff distance from 7 
to 7' is at most 2C + d{p,p'); in particular, this distance is finite. 

Pick two points on 7 very far away from each other; each is distance at most 
2C + d{p,p') from 7', and therefore most of the geodesic between them is within 
distance 25 of the geodesic between corresponding points on 7'. But 7 and 7' 
arc themselves K, e quasigeodesic, and therefore uniformly close to these geodesies. 
Hence some points on 7 are within a uniformly bounded distance of 7', and therefore 
all points on 7 are. 

If h commutes with 17, then h must permute the quasiaxes of g. Therefore h 
takes points on any quasiaxis 7 for g to within a bounded distance of 7. Hence 
C{g), thought of as a subset of G, is quasiisometric to a quasiaxis (that is to say, 
to M), and is therefore virtually Z. □ 

This shows that a hyperbolic group cannot contain a copy of Z ® Z (or, for that 
matter, the fundamental group of a Klein bottle). This is more subtle than it might 
seem; Z ® Z can act freely and properly discontinuously by isometrics on a proper 
5-hyperbolic space — for example, as a parabolic subgroup of the isometries of H^. 

Example 2.3.8. If M is a closed 3-manifold, then tyi{M) is hyperbolic if and only if 
it does not contain any Z © Z subgroup. Note that this includes the possibility that 
7ri(M) is elementary hyperbolic (for instance, finite). This follows from Perelman's 
Geometrization Theorem [31, 32]. 

If g is an isomctry of any metric space X , the translaMon length of g is the limit 
T{g) := lim„_^oo dx{p,g^p)ln for some p & X. The triangle inequality implies that 
the limit exists and is independent of the choice of p. Moreover, from the definition, 
''"(5") = s-iid T(g) is a conjugacy invariant. 

Lemma 2.3.6 implies that for G acting on itself, T{g) = if and only if g has 
finite (and therefore bounded) order. Consequently a hyperbolic group cannot 
contain a copy of a Baumslag-Solitar group; i.e. a group of the form BS{p, q) := 
{a,b I baPb~^ = a'). For, we have already shown hyperbolic groups do not contain 
Z ® Z, and this rules out the case \p\ = \q\, and if \p\ ^ \q\ then for any isometric 
action of BS{p, q) on a metric space, r(a) = 0. 

By properness of Cs{G) and the Morse Lemma, there is a constant N so that 
for any g G G the power g^ has an invariant geodesic axis on which it acts by 
translation. It follows that T{g) G Q, and in fact G -^Z; this cute observation is 
due to Gromov [20]. 

2.4. The Gromov boundary. Two geodesic rays 7,7' in a metric space X are 
asymptotic if they are a finite Hausdorff distance apart. The property of being 
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asymptotic is an equivalence relation, and the set of equivalence classes is the 
Gromov boundary, and denoted dooX. If X is proper and J-hypcrbolic, and x is 
any basepoint, then every equivalence class contains a ray starting at x. For, if 7 
is a geodesic ray, and 5i G 7 goes to infinity, then by properness, any collection of 
geodesies xgi contains a subsequence which converges on compact subsets to a ray 
7'. By 5-thinness each of the triangles xgoQi is contained in a uniformly bounded 
neighborhood of 7, so the same is true of 7'; in particular, 7' is asymptotic to 
7. We give dooX the topology of convergence on compact subsets of equivalence 
classes. That is, 7^ ^ 7 if and only if every subsequence of the ji contains a 
further subsequence whose equivalence classes have representatives that converge 
on compact subsets to some representative of the equivalence class of 7. 

Lemma 2.4.1. Let X be a 6-hyperbolic proper geodesic metric space. Then dooX 
is compact. 

Proof. If 7i is any sequence of rays, and 7^' is an equivalent sequence starting at a 
basepoint x, then by properness 7- has a subsequence which converges on compact 
subsets. □ 



In fact, we can define a (compact) topology on X := X L) d^oX by saying that 
a;i 7 if and only if every subsequence of a sequence of geodesies xxi contains a 
further subsequence which converges on compact subsets to a representative of 7. 
With this topology, X is compact, dooX is closed in X, and the inclusion of X into 
X is a homeomorphism onto its image. 

A bi-infinite geodesic 7 determines two (distinct) points in dooX] we call these 
the endpoints of 7. Two geodesies with the same (finite or infinite) endpoints are 
HausdorfF distance at most 25 apart. Conversely, any two distinct points in d^jX 
are spanned by an infinite geodesic 7. For, if 71, 72 are two infinite rays (starting at 
a basepoint x for concreteness), and gi, hi are points on 71, 72 respectively going to 
infinity, some point Pi on any geodesic gih^ is within 5 of both xgi and xhi, and if 
Pi — >■ 00 then 7i and 72 would be a finite Hausdorff distance apart. Otherwise some 
subsequence of the pi converges to p, and the geodesies gihi converge on compact 
subsets to a (nonempty!) bi-infinite geodesic 7 through p asymptotic to both 71 and 
72. Evidently, geodesic triangles with some or all endpoints at infinity are J'-thin 
for some 5' depending only on i5 (one can take 5' = 205). By abuse of notation, in 
the sequel we will call a metric space ^-hyperbolic if all geodesic triangles — even 
those with some endpoints at infinity — are 5-thin. 

Let X, Y be hyperbolic geodesic metric spaces. Then any quasi-isometric map 
(j) : X ^ Y extends uniquely to a continuous map daoX daoY ■ In particular, 
the Gromov boundary d^aX depends (up to homeomorphism) only on the quasi- 
isometry type of X, and QI(X) acts on d^^X by homeomorphisms. 

If G is a hyperbolic group, we define dooG to be the Gromov boundary of some 
(any) Cs{G). 

Example 2.4.2. If G is free, d^oG is a Cantor set. If G is a tti of a closed surface 
with negative Euler characteristic, d^oG is a circle. If G is a convex cocompact 
Kleinian group, dooG is homeomorphic to the limit set. For example, if G is the 
fundamental group of a hyperbolic 3- manifold with totally geodesic boundary, dooG 
is a Sierpinski carpet. 
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Figure 4. The Sierpinski carpet and the Menger sponge. 



In fact, a theorem of Kapovich-Kleiner [25] says that if G is a hyperbolic group 
which does not spht over a finite or virtually cyclic subgroup, and if dooG is 1- 
dimensional (in the topological sense of dimension), then d^oG is homeomorphic to 
the circle, the Sierpinski carpet, or the Menger sponge. 

Evidently, d^oG is empty if and only if G is finite, and if d^oG is nonempty, it 

has at least two points, and has exactly two points if and only if G is itself quasi- 
isometric to the geodesic joining these two points, which holds if and only if G is 
virtually Z. 

If ,9 G G has infinite order, a quasiaxis 7 is asymptotic to two points G docG. 
Under (positive) powers of g, points stay a constant distance from 7, and move 
towards one of the endpoints, say p"*". As homeomorphisms from X to itself, the 
elements (/" with n — > 00 converge uniformly (in the compact-open topology) on 
X —p~ to the constant map to p'^ . We call p"*" the attracting endpoint and p~ the 
repelling endpoint of g; the actions of g on dooG is sometimes expressed by saying 
that it has source-sink dynamics. 

Example 2.4.3. Let g,h G G he of infinite order, with quasiaxes 7 and 7'. If 7 and 

7' share an endpoint (without loss of generality the attracting endpoint of each) 
and p is close to both 7 and 7', then there are rij, mj — ^ 00 for which d{h~'^^ g"'p, p) 
is bounded. Since the action of G on its Cay ley graph is properly discontinuously, 
it follows that there are distinct j with h~'^'g'^' = h~"^^g^^ so that /i™ = for 
some positive n, m. In particular, in this case g and h together generate a virtual Z 
subgroup, and their quasiaxes have the same endpoints. Otherwise the endpoints 
are disjoint, and because of the source-sink dynamics, Klein's pingpong argument 
implies that sufficiently large powers 5", /i™ generate a (nonabelian) free subgroup 
of G. 

Lemma 2.4.4. Suppose G is nonelementary. Then the action of G on dooG is 
minimal; i.e. every orbit is dense. Consequently d^oG is infinite and perfect. 
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Proof. If G is nonelementary, there are g, h whose quasiaxes have distinct endpoints 
and respectively. If r e dooG is arbitrary, then either g^r — >■ p"*" or g^hr — >■ 
it follows that every attracting/repelling point is in the closure of every orbit. 
Now let 7 be a geodesic from p~ to p+ and let 7' be a geodesic ray asymptotic 
to r. Pick s on 7 and let gt be a sequence of elements with gi{s) G 7' converging 
to r. At most one component of 7 — s can come close to the basepoint x. Hence 
there is some subsequence so that either gip'^ — >• r or gip~ — >• r, and therefore every 
point is in the closure of the orbit of some attracting/repelling point. This proves 
the lemma. □ 

Another way to see the compactification dooX is in terms of (equivalence classes 

of) horofunctions. 

Definition 2.4.5 (horofunction) . Let 7 be a geodesic ray parameterized by length. 
The horofunction (also called the Busemann function) associated to 7 is the limit 

h^{x) := lim djc (a:,7(t)) — t 
The level sets of horofunctions arc called horospheres. 

This limit exists and is finite, by the triangle inequality. Moreover, it is 1- 
Lipschitz. If 7 and 7' are asymptotic, then there is some constant C{-f, 7') so that 
\{bj — b^i) — C\ < 2(5. If X is the endpoint of 7, we let b^: denote any horofunction 
of the form 6-y, and say that b^ is centered at x. 

Here is another way to define 6-y without reference to 7. On any proper metric 
space, the set of 1-Lipschitz functions mod constants is compact (in the topology 
of convergence on compact subsets). For any x G X the function dx{x,-) : A — >• M 
is 1-Lipschitz, and x — >■ dx{x,-) embeds X in the space of 1-Lipschitz functions 
on X mod constants. The closure of this image defines a natural compactification 
of X; quotienting further by bounded functions gives X. For each x G dooX the 
preimage is the set of equivalence classes of functions b^- In this way we think of 
bx as a normalization of the function which measures "distance to x" . 

The space dooX can be metrized following Gromov (see [20]). 

Definition 2.4.6. Fix some basepoint x and some constant a > 1. The a-length 
of a rectifiablc path 7 in A is the integral along 7 of a~''^'^^' \ and the a-distance 
from y to z, denoted dx{y, z), is the infimum of the a-lengths of paths between y 
and z. 

A straightforward calculation shows that there is an oq > 1 so that for a < uq, 
the a-length defines a metric on A. In fact, any ao with (51og(ao) ^ 1 will work. If 
a is too big, a-length still extends to a pseudo-metric on A, but now distinct points 
of dacX might be joined by a sequence of paths with a-length going to 0. Increasing 
a will decrease the Hausdorff dimension of 9oo A; of course, the Hausdorff dimension 
must always be at least as big as the topological dimension. In any case, it follows 
that dooX is mctrizable. 

The following lemma is useful to compare length and a-length. 

Lemma 2.4.7. For a < ao there is a constant A so that for all points y,z £ dooX 
there is an inequality A~^a~''^^'^'^^^ < dx{y,z) < Aa"'^^^^'^^) where dx{x,yz) is 
the ordinary distance from the basepoint x to the geodesic yz. 
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For a proof, sec [13]. 

The quantity dx{x,yz) is sometimes abbreviated by {y\z) (the basepoint x is 
suppressed in this notation), and called the Gromov product. So we can also write 
A^^a^(2/k) < d\{y,z) < Aa"^^'^). Because of this inequality, different choices of a 
give rise to Holder equivalent metrics on d^oX. If X is a group G, we take id as 
the basepoint, by convention. 

Remark 2.4.8. With our notation, {y\z) := dx{x,yz) is ambiguous, since it depends 
on a choice of geodesic from y to z. Since we only care about {y\z) up to a uniform 
additive constant, we ignore this issue. One common normalization, adopted by 
Gromov, is to use the formula {y\z) := ^{dxix,y) + dx{x,z) — dx{y,z)). These 
definitions are interchangeable for our purposes, as the ambiguity can always be 
absorbed into some unspecified constant. 

A group G acting by homeomorphisms on a compact metrizable space M is said 
to be a convergence action if the induced action on the space — A of distinct 
ordered triples is properly discontinuous. 

Lemma 2.4.9. The action of G on d^oG is a convergence action. Moreover, the 
action on the space of distinct triples is cocompact. 

Proof. If X, y, z is a distinct triple of points in dooG, there is a point p within 
distance 5 of all three geodesies xy,yz,zx; moreover, the set of such points has 
uniformly bounded diameter in G. This defines an approximate map from distinct 
triples to points in G. Since the action of G on itself is cocompact, the same is true 
for the action on the space of distinct triples. Similarly, if the action of G on the 
space of distinct triples were not properly discontinuous, we could find two bounded 
regions in G and infinitely many gi in G taking some point in one bounded region 
to some point in the other, which is absurd. □ 

The converse is a famous theorem of Bowditch: 

Theorem 2.4.10 (Bowditch's convergence theorem [3] Thm. 0.1). Let G act faith- 
fully, properly dis continuously and cocompactly on the space of distinct triples of 
some perfect compact metrizable .space M. Then G is hyperbolic and M is G- 

equivariantly homeomorphic to dooG. 

2.5. Patterson— Sullivan measure. The results in this section are due to Coor- 
naert [12] , although because of our more narrow focus we are able to give somewhat 

different and shorter proofs. However by and large our proofs, like Coornaert's, are 
obtained by directly generalizing ideas of Sullivan [43] in the context of Kleinian 
groups. 

Let G be a hyperbolic group, and let G'<„ denote the set of elements of (word) 
length < n, with respect to some fixed generating set. The critical exponent h{G) 
(also called the volume entropy of G) is the quantity 



in other words, the exponential growth rate of G. Since every nonelementary hy- 
perbolic group contains many free groups (Example 2.4.3), h{G) = if and only if 
G is elementary. 



h{G) := limsup — log |G<„| 



n— >oo 
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Define the (Poincare) zeta function by the formula 

geG 

Then Cg{s) diverges if s < h{G) and converges if s > h{G). 
Lemma 2.5.1. The zeta function diverges at s = h{G). 

Proof. We will show in § 3 (Theorem 3.2.2) that for any hyperbolic group G and any 
generating set S there is a regular language L C S* consisting of geodesies, which 
evaluates bijectively to G. In particular, |G'<„| = |L<„| for any n. In any regular 
language L the generating function ^ |L<„|t" is rational (Theorem 3.1.3); i.e. it 
is the power series expansion of p{t)/q{t) for some integral polynomials p, g, and 
consequently C~^{e^^n^) < |L<„| < C{e^^n^) for some real h and non-negative 
integer k, and constant C. Evidently, for L as above, h = h{G) and the zeta 
function diverges at h. □ 

For s > h(G) construct a probability measure Vg on G (i.e. on G U dooG) sup- 
ported in G, by putting a Dirac mass of size e~'^^^^ /(cis) at each g G G. As s 
converges to h from above, this sequence of probability measures contains a sub- 
sequence which converges to a limit i>. Since the zeta function diverges at h, the 
limit v is supported on dG. This measure is called a Patterson-Sullivan measure, 
by analogy with the work of Patterson and Sullivan [30, 43] on Kleinian groups. 

For any g, the pushforward of measure g^fs is defined by g^VsiA) = i's{g~^A), 
and similarly for g^.v. For any g, g' there is an inequality \g'\ — \g\< \gg'\ < \g'\ + \g\- 
Prom the definition on Vg, this implies that g^.i's is absolutely continuous with 
respect to Vg, and its Radon-Nikodym derivative satisfies e^^'^l < d{g*i's)/di^s < 
e^l^l. Passing to a limit we deduce that e^'''^' < d{g,ti')/dv < e'''^'. 

The most important property of the measure v is a. refinement of this inequality, 
which can be expressed by saying that it is a so-called quasiconformal measure of 
dimension h. The "conformal" structure on dooX is defined using the a-distance 
for some fixed a > 1 (recall Definition 2.4.6). 

Definition 2.5.2 (Coornaert). For 5 € G define jg : d^oX — )• M by 

jg{y) = (id) -6.(3) 

for some horofunction by centered at y. A probability measure u on dooX is a 
quasiconformal measure of dimension D itg^u is absolutely continuous with respect 
to v for every g G G, and there is some constant G independent of g so that 

G-'jgiy)" < d{g.v)ldv < Gjg{y f 

Notice that the ambiguity in the choice of horofunction by is absorbed into the 

definition of jg (which only depends on by mod constant functions) and the constant 
G. The support of any quasiconformal measure is evidently closed and G-invariant, 
so by Lemma 2.4.4, it is all of dooG. 

From the definition of the Radon Nikodym derivative, u is a quasiconformal 
measure of dimension D if there is a constant G so that for all y we can find a 
neighborhood ^ of y in X for which 

C-'jgiy)''iy{A) < i^ig-'A) < Cjg{y)''uiA) 

for all AcV. 
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Remark 2.5.3. For some reason, Coornaert chooses to work with pullbacks of mea- 
sure g*!^ := g^^v instead of pushforward. Therefore the roles of g and g~^ are 
generally interchanged between our discussion and Coornaert 's. 

Theorem 2.5.4 (Coornaert [12] Thm. 5.4). The measure v is a quasiconformal 

measure of dimension D where D — h/\oga. 

Proof. Evidently the support of ly is G-invariant, and is therefore equal to all of 
daoG. Let y G dooX, let by be a horofunction centered at y, and let g G G. 

By 5-thinness and the definition of a horofunction, d{g, z) — d{id, z) is close to 
by{g) — by (id) for z sufficiently close to y. In particular, there is a neighborhood V 
of 2/ in X so that 

\g-^z\ - 1^1 - C < by{g) - by{id) < \g-''z\ -\z\+C 

for some C, and for all zinV. 

For each s > ft, we have g,:Vs{z) /vs{z) = Vs{g~'^z)/i's{z) — e"***^'® Taking 
the limit as s — ^ /i and defining £> by o''^ = e^ proves the theorem. □ 

To make use of this observation, we introduce the idea of a shadow, following 
Sullivan. 

Definition 2.5.5. For g G G and R a positive real number, the shadow S{g,R) is 
the set of y e dooG such that every geodesic ray from id to y comes within distance 
Roig. 

Said another way, y is in S{g, R) if g comes within distance R of any geodesic 
from id to y. Given R > 26, for any fixed n the shadows S{g, R) with l^l = n cover 
dooG efficiently: 

Lemma 2.5.6. Fix R. Then there is a constant N so that for any y € d^oG and 
any n there is at least 1 and there are at most N elements g with \g\ = n and 
yGS{g,R). 

Proof. If i? > 2(5, if 7 is any geodesic from id to y, and if g is any point on 7, 
then y e S{g,R). Conversely, if g and h are any two elements with \g\ = \h\ and 
y e S{g, R) n S{h, R) then d{g, h) < 2R. □ 

Sullivan's fundamental observation is that the action of g~^ on S{g, R) is uni- 
formly close to being linear, in the sense that the derivative d{gt,v)/dv varies by a 
bounded multiplicative constant on S{g,R): 

Lemma 2.5.7. Fix R. Then there is a constant C so that for any y £ S{g,R) 
there is an inequality 

C-^alsl < jg{y) < Calsl 

Proof. Recall jg{y) = a^3'(''*)~''!'(s) for some horofunction by. But by 5-thinness and 
the definition of a shadow, there is a constant C" so that 

\g\-C' <by{id)-by{g)<\g\ + C' 

for any y in S{g, R). □ 

Prom this one readily obtains a uniform estimate on the measure of a shadow: 
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Lemma 2.5.8. Fix R. Then there is a constant C so that for any g ^ G there is 
an inequality 

C-i^-lsl-D < u{S{g,R)) < Ca-lsl-° 

Proof. Let mo < 1 be the measure of the biggest atom of and fix tuq < m < 1. 
By compactness of dooG there is some e so that every ball in dooG of diameter 
< e has mass at most m. Now, g~^S{g, R) is the set of y e dooG for which 
every geodesic ray from to y comes within distance R of id. As i? — > oo, the 
diameter of dooG — g~^S{g,R) goes to zero uniformly in g (this follows from the 
quasi-equivalence of (y, z) and a-(fl^); see Lemma 2.4.7). Consequently there is 
some Ro so that for all R > Rq the measure u{g~^S{g,R)) is between 1 — m and 
1, independent of g. 

Now, by Lemma 2.5.7 and the definition of a quasiconformal measure, there is a 
constant Ci so that 

Cr^alsl^ < uig-'S{g,R))/uiS{g,R)) < Cial^l^ 

Taking reciprocals, and using the fact that 1 — m < i'{g~^S{g,R)) < 1 completes 
the proof (at the cost of adjusting constants). □ 

Note that the argument shows that v has no atoms, since any y e d^oG is 
contained in some shadow of measure < Ca~^"' for any n. We deduce the following 
corollary. 

Corollary 2.5.9 (Coornaert [12] Thm. 7.2). Let G be a hyperbolic group. Then 
there is a constant G so that 

G-'^e''" < \G<n\ < Ce''" 

for all n. 

Proof. The lower bound is proved in § 3, so we just need to prove the upper bound. 

For each g with \g\ = n Lemma 2.5.8 says e"'*" = a~^" < G\i'{S{g, R)). On 
the other hand. Lemma 2.5.6 says that every point y e d^oG is in at most A'' sets 
S{g,R) with \g\ = n, so 

\Gn\e-^^C^' < J2 ''iS{g,R)) < A^KU|3|=„5(5,ii)) = N 

lsl=n 

□ 

Corollary 2.5.9 has important consequences that we will explore in § 3. 

A second corollary gives very precise metric and dynamical control over dooG. 
An action of a group G on a space X is said to be ergodic for some measure v on 
X if for any two subsets A, B oi X with i^{A), v{B) > there is some g G G with 

Kff(^)nB) >o. 

Corollary 2.5.10 (Coornaert [12] Cor. 7.5 and Thm. 7.7). Let u be a quasiconfor- 
mal measure on dooG of dimension D. Then v is quasi- equivalent to D -dimensional 
Hausdorff measure; i.e. there is a constant C so that C^^ (A) < i'{A) < 
CH^{A) for any A. In particular, the space dooG has Hausdorff dimension D, 
and its D-dimensional Hausdorff measure is finite and positive. Moreover, the ac- 
tion of G on dG is ergodic for v. 
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Proof. Evidently, the second and third claims follow from the first (if A is a, G- 
invariant subset of dooG of positive j/-measure, the restriction of to ^ is a qua- 
siconformal measure of dimension D, and is therefore quasi-equivalent to and 
thence to f. In particular, A has full measure). So it suffices to show that f and 
arc quasi-equivalent. 

Since Cf ^a'^^l^) < z) < Cia-^^'l^) it follows that every metric bah B{y, r) 

in dooG can be sandwiched between two shadows S{gi,R) c B{y,r) C S{g2,R) 
with a~l3i' > r/C2 and a"l9=^l < rC2- From Lemma 2.5.8 we obtain C^^V^ < 
v{B{y,r)) < C2r^ ■ From this and the definition of Hausdorff measure, we will 
obtain the theorem. 

If A is any measurable set, cover A by balls Ui of radius Cj < e. Then 

i 

so letting e -j> we get u{A) < G2H^{A). 

The following proof of the reverse inequality was suggested to us by Curt Mc- 
Mullcn. For any S let K be compact and U open so that K C A C U and both 
^{U — K) and H^{U — K) are less than 5. By compactness, there is an e so that 
every ball of radius < e centered at a point in K is contained in U. Now inductively 
cover K by balls I7i, f/2, • • • of non- increasing radius < e in such a way that the 
center of each Ui is not in Uj for and j < i. Then the balls with the same centers 
and half the radii are disjoint, so 

J2er = 2^j:{e./2)^<Csu{U) 

and therefore F^(i^) < Cs^U). Taking 6^0 gives iJ^(A) < CsiyiA) and we are 
done. □ 

Remark 2.5.11. Coornaert only gives the proof of the inequality i^{A) < CH^{A) in 
his paper, referring the reader to Sullivan [43] for the proof of [A) < 

However, there is a gap in Sullivan's proof of the reverse inequality, of which the 
reader should be warned. 

Remark 2.5.12. The approximate linearity of on S{g,R) has many other ap- 
plications. For example, see the proof of Theorem 1 in [42]. 

3. Combings 

On a Riemannian manifold, a "geodesic" is just a smooth path that locally min- 
imizes length (really, energy). A sufficiently long geodesic is typically not globally 
length minimizing, and the entire subject of Morse theory is devoted to the dif- 
ference. By contrast, one of the most important qualitative features of negative 
curvature is that (quasi)-geodesity is a local property (i.e. Lemma 2.2.13). 

This localness translates into an important combinatorial property, known tech- 
nically as finiteness of cone types. This is the basis of Cannon's theory of hyperbolic 
groups, and for the more general theory of automatic groups and structures (see 
[15] for more details). 

3.1. Regular languages. Let 5 be a finite set, and let S* denote the set of finite 

words in the alphabet 5*. An automaton is a finite directed graph F with a distin- 
guished initial vertex, and edges labeled by elements of S in such a way that each 
vertex has at most one outgoing edge with a given label. Some subset of the vertices 
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of r are called accept states. A word w is S* determines a simplicial path in F, by 
starting at the initial vertex, by reading the letters of w (from left to right) one by 
one, and by moving along the corresponding edge of F if it exists, or halting if not. 
Associated to F there is a subset L c S* consisting of precisely those words that 
can be read in their entirety without halting, and for which the terminal vertex of 
the associated path ends at an accept state. One says that L is parameterized by 
(paths in) F. 

Definition 3.1.1. A subset L C S'* is a regular language if there is a finite directed 
graph F as above that parameterizes L. 

Note that F is not part of the data of a regular language, and for any given 
regular language there will be many graphs that parameterize it. A language is 
prefix- closed if, whenever w ^ L, every prefix of w is also in L (the empty word is 
a prefix of every word). 

Lemma 3.1.2. If L is prefix-closed and regular, there is aT parameterizing L for 
which every vertex is an accept state. 

Proof. If F is any graph that parameterizes L, remove all non-accept vertices and 

the edges into and out of them. □ 

Theorem 3.1.3 (Generating function). Let L be a regular language, and for each 
n, let Ln denote the set of elements of length n, and L<n the set of elements of 

length < n. Let s{t) := and h{t) := ^ |L<„|t" be (formal) generating 

functions for \L„\ and\L<n\ respectively. Then s{t) and b{t) are rational; ie. they 
agree as power series expansions with some ratio of integral polynomials in t. 

Proof. Note that b{t) = s{t) /{1 — t) so it suffices to prove the theorem for s{t). Let 
F parameterize L, and let AL be the adjacency matrix of F; i.e. Mij is equal to 
the number of edges from vertex i to vertex j. Let vq be the vector with a 1 in 
the initial state, and elsewhere, and let Va be the vector with a 1 in every accept 
state, and elsewhere. Then |L„| = v'^M'^Va- 

A formal power series A{t) := X^a^f" is rational if and only if its coefficients 
satisfy a linear recurrence; i.e. if there are constants co, • • • ,Cd. (not all zero) so that 
coan + cia„_i + - • ■ + Cdan-d = for all n> d. For, A{t){co + cit+- ■ ■ + cdt'^) vanishes 
in degree > d, and is therefore a polynomial (reversing this argument proves the 
converse). 

If p{t) = 'Y^Pit'''~'^ is the characteristic polynomial of M, then p{M) = 0, and 



Another way of expressing s{t), more useful in some ways, is as follows. 

Proposition 3.1.4. Let L be a regular language. Then there is an integer D so 
that for each value of n mod D either \Ln\ is eventually zero, or there are finitely 

many constants Aj and polynomials pi so that = }Ji(n)A" H + pk{n)X^ for 

all sufficiently large n. 

For a proof see e.g. [18] Thm. V.3. In particular, either |-L<„| has polynomial 
growth, or C~^{n''X^) < \L<n\ < C{n''X") for some real A and integer k, and 
constant C. 



= V^M^-''p{M)Va=Po\Ln\+Pl\Ln-l 



+ \-Pd\Ln-d\ 



proving the theorem. 



□ 
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3.2. Cannon's theorem. Let 5* be a set. A total order ^ on S* extends to a unique 
lexicographic (or dictionary) order on S* as follows: 

(1) the empty word precedes everything; 

(2) if u and v are both nonempty and start with different letters s,t £ S then 
u ~< V if and only if s -< and 

(3) if u -< V and w is arbitrary, then wu -< wv. 

If G is a group and S* is a generating set for G, there are finitely many geodesic words 
representing any given element; the lexicographically first geodesic is therefore a 
canonical representative for each element of g, and determines a language L c S* 
that bijects with G under evaluation. We denote evaluation by overline, so if m € 5* , 
we denote the corresponding element of G by u. We similarly denote length of an 
element of 5* by | • |. So we always have |u| < |u| with equality if and only if u is 
geodesic. 

Given g G G the cone type of g, denoted cone{g), is the set of h G G for which 
some geodesic from id to gh passes through g. For any n, the n-level of g is the 
set of h in the ball i?„(id) such that \gh\ < \g\. Cannon showed that the n level 
(for n sufficiently large) determines the cone type, and therefore that there are only 
finitely many cone types. 

Lemma 3.2.1 (Cannon [10] Lem. 7.1 p. 139). The 2S + 1 level of an element 
determines its cone type. 

Proof. Let g and h have the same 2S + 1 level, and let u, v be geodesies with u = g 
and V — h. Only id has an empty 25 + 1 level, so we may assume u, v both have 
length > 1. We prove the lemma by induction. Suppose uw, vw and uws are 
geodesies, where s G S. We must show that vws is a geodesic. Suppose to the 
contrary that there is some wTw2 = vTUs where |wi| = |u| — 1 and \w2\ < |w^| + 1. 
Then h~^iui is in the 26 + 1 level of h, which agrees with the 25 + 1 level of g, 
and therefore \gh~^Wi\ < \g\. But then concatenating a geodesic representative of 
gh~^Wi with W2 gives a shorter path to uws, certifying that uws is not geodesic, 
contrary to assumption. □ 




Figure 5. A shortcut W2 from the 26 + 1 level of h to vWs gives a 
shortcut from the 26+1 level of g to uws. This figure is adapted 
from [15]. 
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The following theorem is implicit in [10], though expressed there in somewhat 
different language. 

Theorem 3.2.2 (Cannon [10]). Let G be a hyperbolic group, and S a symmetric 
generating set. Fix a total order -< on S. Then the language of lexicographically 
first geodesies is prefix-closed and regular. 

Proof. That this language is prefix-closed is obvious. We show it is regular by 

describing an explicit parameterizing graph. 

As a warm up, we show first that the language of all geodesies is regular. A 
parameterizing graph can be taken as follows. The vertices (all accept states) are 
precisely the set of cone types, and there is an edge labeled s from a cone type of the 
form cone((?) to one of the form cone((?s) whenever \gs\ = \g\ + 1. By the definition 
of cone types, this is well-defined. By Lemma 3.2.1, the number of cone types is 
finite, so this is a finite graph. By construction, this graph exactly parameterizes 
the language of all geodesies. 

Now fix a total order -< on S. For each g G G, let Ug be the lexicographically 
first geodesic from id to g. For each g € G a competitor of g is some h with \h\ = \g\, 
with Uh -< Ug, and for which d{uh\<i,Ug\<i) < 26 for all i, where Ug\<i denotes the 
prefix of Ug of length i, and similarly for Uh\<i (this is described by saying that Uh 
synchronously fellow-travels Ug). 

If there is some g' with \g'\ = \g\ + d{g,g') and \g'\ = \h\ + d{h,g') then by 
(5-thinness and the definition of geodesies, Uh synchronously fellow-travels Ug. It 
follows that for all g G G and s € S we have Ugg = UgS if and only if UgS is a 
geodesic, and there is no competitor h of g and s' € S* so that hs' — gs. 

Given g £ G define C{g) C B2s{id) to be the set of h for which gh is a competitor 
of g. Associated to g is the list L{g) of pairs {h G C{g), conc{gh)) together with the 
cone type of g itself. Note that the set of possible lists L{g) is finite. We can now 
define a parameterizing graph by taking the vertices (all accept states) to be the 
possible lists L{g), and there is an edge labeled s from a list of the form L{g) to a 
list of the form L{gs) if and only if \gs\ — \g\ + l, and there is no /i € C{g) and s' G S 
with ghs' = gs. This is evidently a finite directed graph, which parameterizes the 
Ug-, we must show it is well-defined. 

First of all, h £ C (gs) if and only if one of the two following possibilities occurs: 

(1) there is some h' S C{g) and s' G S (1 cone{gh') with gh's' = gsh; or 

(2) there is some s' -< s in S (1 cone{g) with gs' = gsh. 

Both of these possibilities depend only on C{g), cone{g) or cone{gh') for some 
h' e C{g), and not on g itself. Second of all, if ft e C{gs), then cone{gsh) depends 
only on cone{gh') and s' in the first case, and on cone(5) and s' in the second case. 
This shows the graph is well-defined, and completes the proof of the theorem. □ 

Remark 3.2.3. This completes the proof of Lemma 2.5.1, and the subsequent results 
in § 2.5. The reader will note that the results in this section do not depend on 
Lemma 2.5.1, so our reasoning has not been circular. 

3.3. Combings and combable functions. 

Definition 3.3.1. Let G be a group, and S a generating set. A combing for G 

(with respect to S) is a prefix-closed regular language L C S* which bijccts with 
G under evaluation, and satisfies \u\ = \u\ for all u G L (i.e. L is a language of 
geodesies). 
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Theorem 3.2.2 says that every hyperbohe group admits a combing. If L is a 
combing with respect to S, the L-cone type of g, denoted coneL(5), is the set of 
ft G G for which the L-geodesic evaluating to gh contains a prefix (which is also an 
i-geodesic) evaluating to g. There is a graph T parameterizing L with one vertex 
for each L-cone type, and an edge from cone 1,(5) to coneL{gs) labeled s whenever 
s e conei,(g). 

Remark 3.3.2. The reader should be warned that many competing definitions of 
combing exist in the literatiire. 

Suppose L is a combing of G, and F is a graph parameterizing L, so that there 
is a (length-preserving) bijection between directed paths in F starting at the initial 
vertex, and words of L, by reading the edge labels of the path, liu G L, we let 7(m) 
denote the corresponding path in F, and j(u)i the successive vertices in F visited 
by 7(u). 

Definition 3.3.3. A function ^ : G ^ Z is weakly combable with respect to a 
combing L if there is a graph F parameterizing L and a function d(f) from the 
vertices of F to Z so that 0(m) = J2i d(t>{'^{u)i) for all u & L. 

A function (/) is combable if it is weakly combable with respect to some combing 

L, and if there is a constant C so that \(f>{gs) — (f>(g)\ < C for all g € G and s € S; 
and it is bicombable if it is combable, and further satisfies \(t){sg) — 4>{g)\ < C. 

Remark 3.3.4. It might be more natural to define a function d(p on the edges of F 
instead of its vertices; however, associated to any directed graph F there is another 

graph — the line graph of F — whose vertices are the edges of F, and whose edges 
are the composable pairs of edges of F, and the line graph of F parameterizes L if 
F does. 

Lemma 3.3.5 (Calegari-Fujiwara [8] Lem. 3.8). The property of being combable 
or bicombable does not depend on the choice of a generating set or a combing. 

The proof proceeds along the same lines as Theorem 3.2.2. The key point is 
that words in L are (uniformly) quasigeodesic with respect to S", and therefore 
stay within a bounded distance of words in L' with the same evaluation. Therefore 
an automaton reading the letters of an L'-word can keep track of the states of a 
collection of automata simultaneously reading nearby L-words, and keeping track 
of how (p changes as one goes along. See [8] for details. 

Example 3.3.6. Word length in any generating set is bicombable. In fact, if S is 
a (possibly unsymmetric) set which generates G as a semigroup, word length in S 
is bicombable. One can generalize word length by giving different generators (and 
corresponding edges in the Cayley graph) different lengths; providing the lengths 
are all integral and positive, the resulting (geodesic) word length is bicombable. 

Example 3.3.7. The sum or difference of two (bi)combable functions is (bi)combable. 

Example 3.3.8. The following definition is due to Epstein-Fujiwara [16] (also see 
[5]). Let (J be a path in Cs{G). A copy of cr is a translate ga for some g & G. Given 
a path 7 in 03 (G), define 0^(7) to be the maximal number of disjoint copies of a 
in 7, and ioi g G G define the small counting function Cq- : G — >■ Z by the formula 



Ca{g) = \g\ - inf(|7| - Ca(7)) 
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Counting functions are bicombable. In fact, we can add ct to 5* as a (semigroup) 
generator, but insist that the (directed) edges labeled a have length |cr| — 1 instead of 
1; this defines a new distance function | • \„ which is bicombable (by Example 3.3.6), 
and therefore so is the difference | • | — | • jg. = Co- (by Example 3.3.7). 

Many variations on this idea are possible; for instance, the "big" counting func- 
tions Cct which count all copies of a in 7, not just the maximal number of disjoint 
copies. 

3.4. Markov chains. A directed graph F is sometimes called a topological Markov 
chain. A topological Markov chain can be promoted to a genuine (stationary) 
Markov chain by assigning probabilities to each edge in such a way that the proba- 
bilities on the edges leaving each vertex sum to 1. Recall that we write the adjacency 
matrix as Af ; wc think of this as an cndomorphism of the vector space V spanned 
by the states of V . Let 1 denote the vector with all components equal to 1, and let 
L denote the vector corresponding to the initial state. 

Two states in a topological Markov chain are said to be com,m,unica,ting if there 
is a directed path from each to the other. The property of being comrmmicating 
is an equivalence relation. We write Ci — >■ C2 for equivalence classes Ci if there is 
a directed path from some (any) vertex of Ci to some (any) vertex of C2; observe 
that — > is a partial order. We call each equivalence class a component. 

The induced (directed) subgraph associated to a component C is itself a topo- 
logical Markov chain. Its adjacency matrix Mc has the property that for any i and 
j there is an n (in fact, infinitely many n) so that {MQ)ij is positive; one says such 
a Markov chain is irreducible. If there is a fixed n so that {M^)ij is positive for all 
i, j we say the Markov chain is aperiodic; this holds exactly when the gcd of the 
lengths of all loops in C is 1. A Markov chain (on a finite state space) which is 
both irreducible and aperiodic is ergodic. 

Lemma 3.4.1 (Perron-Frobenius). Let M be a real matrix with positive entries. 

Then there is a unique eigenvalue A of biggest absolute value, and this eigenvalue 
is real and positive. Moreover, X is a simple root of the characteristic polynomial, 
and it has a right (left) eigenvector with all components positive, unique up to 
scale. Finally, any other non-negative right (left) eigenvector is a multiple of the A 

eigenvector. 

Proof. Since the entries of M are positive, M takes the positive orthant strictly 
inside itself. The projectivization of the positive orthant is a simplex, and therefore 
Af takes this simplex strictly into its interior. It follows that Af has a unique 
attracting fixed point in the interior this simplex; this fixed point corresponds to 
the unique eigenvector v (up to scale) with non-negative entries, and its entries are 
evidently all positive, and its associated eigenvalue A is real and positive. 

If TT is any plane containing this unique positive eigenvector, the projectivization 
of TT is an MP^; since the eigenvector becomes an attracting fixed point in this MP^, 
it is not the only fixed point. This shows that A is a simple eigenvalue; a similar 
argument shows that —A is not an eigenvalue. 

Let /i be any other eigenvalue. If jj, is real, then < A. Suppose /U is complex, 
acting as composition of a dilation with a rotation on some plane tt. If = A then 
the restriction of M to tt® (ii) acts projectively like a rotation; but this contradicts 
the fact that u is a projective attracting fixed point. This proves the theorem. □ 
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If M is non-negative, there is still a non-negative real eigenvector v with a 
real positive eigenvalue A, and every other eigenvalue fi satisfies |^| < A. In this 
generality, A might have multiplicity > 1, and the Jordan block associated to A 
might not be diagonal. However if M is irreducible, then A has multiplicity 1, the 
eigenvector v is strictly positive, and every other eigenvalue with absolute value A 
is simple and of the form ^'^^I^X. These facts can be proved similarly to the proof 
of Lemma 3.4.1 

Now let G be a hyperbolic group, L a combing with respect to some generat- 
ing set, and F a graph parameterizing L. Let Vc be the quotient directed graph 
whose vertices are the components of F. Note that Vc contains no directed loops. 
Associated to each vertex of Vq is an adjacency matrix Mc which has a unique 
maximal real eigenvalue A(C) of multiplicity 1. We let A = maxc A(C), and we call 
a component maximal if A(C) — A. 

The next lemma is crucial to what follows, and depends on Coornaert's estimate 
of the growth function (i.e. Corollary 2.5.9). 

Lemma 3.4.2. The maximal components do not occur in parallel; that is, there is 
no directed path from any maximal component to a distinct maximal component. 

Proof. Since there is a directed path from the initial vertex to every other vertex, 
the number of paths of length n is of the form p(n)A" -|- 0(<z(n)^") for polynomials 
p, q and ^ < A, where A is as above. Moreover, the degree oi p is one less than the 
length of the biggest sequence of maximal components Co — )• Ci • • • — )• C^eg{p) ■ 
The number of paths of length n is equal to the number of elements of G of length 
n, so Corollary 2.5.9 implies that the degree of p is zero. □ 

It follows that all but exponentially few paths 7 of length n in 7 are entirely 
contained in one of the maximal components of F, except for a prefix and a suffix 
of length 0(log(n)). Consequently, the properties of a "typical" path in F can 
be inferred from the properties of a "typical" path conditioned to lie in a single 
component. 

For any vector v, the limits 



exist, and are the projections onto the left and right A-cigcnspaces respectively. 
Heuristically, £{v) is the distribution of endpoints of long paths that start with 
distribution v, and p{v) is the distribution of starting points of long paths that end 
with distribution v. 

Recall that t denotes the vector with a 1 in the coordinate corresponding to the 
initial vertex and Os elsewhere, and 1 denotes the vector with all coordinates equal 
to 1. Define a measure /i' on the vertices of T by = £{L)ip{l)i, and scale /i' to a 
probability measure p. Define a matrix N by Nij = Mijp{l)j/ Xp{l)i if ^ 0, 
and define Nij = Sij otherwise. 

Lemma 3.4.3. The matrix N is a stochastic matrix (i.e. it is non-negative, and 
the rows sum to 1) and preserves the measure p,. 



n-1 



n-1 



p{v) := lim n 



-1 
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Proof. If p(l)i — then Nij = 1 by fiat. Otherwise 

V-^r _ Mi,p{l)j _ {Mp{l))i _ 

Ap(l)i Ap(l)i 

. To see that N preserves /i' (and therefore /i), we calculate 

□ 

In words, yitj is the probability that a point on a path will be in state i, conditioned 
on having originated at the initial vertex in the distant past, and conditioned on 
having a distant future. 

3.5. Shift space. For each n let F„ denote the set of paths in F of length n starting 
at the initial vertex, and let X„ denote the set of all paths in F of length n. We 
can naturally identify with the vertices of F. 

Restricting to an initial subpath defines an inverse system •••—>• X„ —>••••—>■ 
X\ — >■ Xq, and the inverse limit X^o is the space of (right) infinite paths. Similarly 
define C X^o- If we give each Xn and Yn the discrete topology, then Xoo and 
Foo are Cantor sets. 

If X := Xo,Xi,--- and x' := Xq, x'^ • • • are two elements of Xoo, we define {x\x') 
to be the first index at which x and x' differ, and define a metric on X^o by setting 
d{x,x') = a~^^l^ ' for some a > 1 (the notation (-j-) is deliberately intended to 
suggest a resemblance to the Gromov product). If we like, we can define X = 
UiXi U Xoo and metrize it (as a compact space, in which each X„ sits as a discrete 
subset) in the same way. Similarly, give Y^o the induced metric, and define Y = 
UiYi U Yao likewise. 

The shift operator T : X^o — > -'^oo is defined by {Tx)i = Xi+i. We define a 
probability measure /x on each X„ by iJ,{xo---Xn) = Ma:o-^xoxi-^ccicc2 ' ' ' -^ccn-ix„ 
where /i and N are the measure and stochastic matrix whose properties are given 
in Lemma 3.4.3. By the definition of an inverse limit, there is a map X^o ^ Xn for 
each n which takes an infinite path to its initial subpath of length n; the preimages 
of subsets of the X„ under such maps are a basis for the topology on X^o, called 
cylinder sets. The measures fi as above let us define a Borel probability measure 
fj, on Xoo by first defining it on cylinder sets (note that the definitions of fj, on 
different X„ are compatible) and extending it to all Borcl sets in the standard 
way; Lemma 3.4.3 implies that /x is T-invariant (i.e. p{A) — fi{T^^{Aj) for all 
measurable A c ^oo)- 

There is a bijection between Yn and I/„, and by evaluation with G„. This map 
extends continuously to a map E : Y ^ G, hy sending Y^o — >■ dooG. 

Lemma 3.5.1. The map E : Y ^ G is surjective, Lipschitz in the a-metric, and 
bounded-to-one. 

Proof. That the map is Lipschitz follows immediately from the definition, and the 

observation that {E{y)\E{y')) < {y\y') — 5 for y,y' G Y. The restrictions : F„ — > 
Gn are all bijections, so we just need to check that Foo — > i9ooG is surjective and 
bounded-to-one. 
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Since E is continuous, Y is compact and G is HausdorfF, the image is compact. 
Since the image is dense (because it contains C?„ for all n), it is surjective. 

Finally, observe that if y and y' are any two points in Fqoi and 7,7' are the 
associated infinite geodesies in G, then 7 n 7' is a compact initial segment, since 
after they diverge they never meet again (by the definition of a combing). Fix 
X € 9ooG, let yi be a finite subset of E~^{x), and let 7, be the geodesic rays in 
G corresponding to the t/j. For all but finitely many points p on any ji, each 7^ 
intersects the ball Bs{p) dis jointly from the others. In particular, the number of 
points in the preimage of any point in dooG is bounded by the cardinality of a ball 
(in G) of radius S. □ 

Recall that in § 2.5 we defined probability measures Vg on G for each s > h{G). 
Note that e'''-'^-' = A where A is as above. For each n we define a probability measure 
on Yn (which, by abuse of notation, we call I's) by I'siu) = i^s{E{y)conei,{E{y))) for 
y G Ym and observe that the limit as s — >■ h{G) from above (which we denote y{y)) 
exists and depends only on the cone type conei(£^(y)). Since the Patterson-Sullivan 
measure v is supported in (?ooG, the measures v on each Y^ are compatible, thinking 
of each as a collection of cylinder sets in Y^, and define a unique probability 
measure u on Y^ which pushes forward under E to v on dooG. 

Lemma 3.5.2. The measure fx on X^o is the limit fi = lim„_>.oo ^ J2^^o Tlu. 

Proof. We give the sketch of a proof. For any y, let be the (regular) language 
of suflttxcs of words in L with y as a prefix, and let be the subset of of length 
n. Then Vs{y) = Cg'(«) E„ 

If there is no path from the final state ?/„ to a maximal component, the growth 
rate of is strictly less than that of L, and Vs{y) 0. Otherwise both growth 
functions are eventually of the form CA" plus something exponentially small com- 
pared to A". Define measures on Yn by Vm{y) = ^ Y^TLi A"^'^'"*"*) |L^|. Then 
by considering the form of the growth functions of L and L^, we see that there is a 
constant C (not depending on y or n) so that \\mm-ioo i^m{y) = Cv{y). Scahng 
to be a probability measure;, we can set C = 1. 

The proof now follows from the definition of ^,■,N•, see [8] Lem. 4.19 for details. 

□ 

3.6. Limit theorems. Let ^1,^2,- •• be a (stationary) irreducible Markov chain on 
a finite state; space, with stationary measure jjL, and let / be a real- valued function 
on the state space (since this space is finite, there are no additional assumptions 
on /; in general we require / to be integrable, and have finite variance). Define 
Fn:= Tri=imi\^^^A = J fdiJi. 

Theorem 3.6.1 (Markov's central limit theorem). With notation as above, there 
is some a >Q so that for any r < s, 

r<ElL^<s)=^ fe-^'/^dx 

Equivalently, there is convergence in probability n~^/^(F„ — nA) — )• A''(0,cr) 
where A^(0, a) denote the normal distribution with mean and standard deviation 
(7 (in case cr = 0, we let A^(0, a) denote a Dirac mass centered at 0). 

Now, each maximal component C as above is a stationary irreducible Markov 
chain, with stationary measure the conditional measure fjb\G. The measure on 



Hm P 
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decomposes measurably into the union of (shift- invariant) subspaces Xoo{C), the 
subspace of (right) infinite sequences contained in the component C. Consequently, 
if ^ is a combable function on G, then for each maximal component C, there are 
constants Ac = J^d(j)/iJ,{C)diJ, and ac, so that for n-a.e. x G Xoo(C), the random 

variable n~^/'^{J2"^o '^^i^i) ~ nAc) converges in probability to N{0,ac)- 

By Lemma 3.5.2, for i/-a..e. y € there is a unique C so that T"y G X^{C) for 
sufficiently big n; we say that y is associated to the component C. Let Y^{C) be 
the set of y associated to a fixed C. For v-a-.e. y G Y^oiC) we have convergence in 
probability n^^^iY.'^'o d(f>{y^) - nAc) ^ iV(0, ac) (one way to sec this is to observe 
that this is a shift-invariant tail property of y, and use Lemma 3.5.2). 

For combable functions, this is the end of the story. It is certainly possible for 
the constants Ac,o-c to vary from component to component. But for bicombable 
(j) we have the following key lemma: 

Lemma 3.6.2. Let </> be bicombable. Then there are constants A, a so that Ac = A 
and Gc = <y for all maximal components C. 

Proof. Call y G Y^o typical if there are constants Ay and ay (necessarily unique) 
so that n^^^{J27=o '^4>{yi) — nAy) N{Q, ay). For each C we have seen that v-B,.e. 
y G Yoo{C') is typical with Ay = Ac and ay = ac- 

The map E : Y^o — > dooG is finite-to-onc, and takes the measure ly on 1^00 to the 
Patterson-Sullivan measure on dooG. Hence E{Yoo{C)) has positive measure for 
each C. Let y G dcoG be typical, and let id, gi, 172, • • • be the associated geodesic 
sequence of elements in G converging to E{y). Now let g be arbitrary, let y' be any 
element of Foo with E{y') = gE{y), and let id, g[, g'2, ■ ' ' be the geodesic sequence of 
elements in G associated to y' . By (5-thinness, d{g[, ggi) is eventually approximately 
constant, and therefore bounded. Since (j) is bicombable, y' is typical, with Ayi = Ay 
and ay' = ay. But the action of G on is ergodic for z/. by Corollary 2.5.10, and 
therefore for any C,C' there are typical y G i^oo(C), y' G Yoo{C') with Ay = 
Ac, ay = ac and Ay> = Ac, ay' = ac, and with y' = gy for some g. This 
completes the proof. □ 

Corollary 3.6.3 (Calegari-Fujiwara [8]). Let G be hyperbolic, and let 4> be bicom- 
bable. Then there are constants A, a so that if gn denotes a random element of Gn 
(in the v measure), there is convergence in probability n~^^^{(p{gn)—nA) — )■ A^(0, a). 

Note that A and a as above are algebraic, and one can estimate from above the 
degree of the field extension in which they lie from the complexity of T. 

The uniform measure and the measure ;/ on G„ arc uniformly quasi-equivalent 
on a large scale, in the sense that there are constants R and G so that for any 
g G Gn, there is an inequality 

C-^\Bn{g) n G„|/|G„| < iy{Bn{g) n G„) < G\BR{g) D G„|/|G„| 

It follows that if gn denotes a random element of G„ (in the uniform measure), the 
distribution n^^/^(0(5„) — nA) has a tail that decays hkc Gie^'-^^* . 

Since length with respect to one generating set is bicombable with respect to 

another, we obtain the following corollary: 

Corollary 3.6.4. Let G be hyperbolic, and let S and S" be two finite generating 
sets for G. There is an algebraic number Xs,s' so that if gn is a random element 
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of S of word length n, then the distribution n ^^■^(|<?n|s' — nXs,s') has a tail that 
decays like Cie"*^^* when n is sufficiently large. 

It is a slightly subtle point that Xs',s > -^s S'' ^^'^ inequality is strict except 

for essentially trivial cases. 

3.7. Thermodynamic formalism. To push these techniques further, we must 
study classes of functions more general than combable functions, and invoke more 
sophisticated limit theorems. There is a well-known framework to carry out such 
analysis, pioneered by Ruelle, Sinai, Bowen, Ratner, Parry etc.; [36] is a standard 
reference. 

The setup is as follows. For simplicity, let M be a fcx A; matrix with 0-1 entries for 
which there is a constant n so that all the entries of M" are positive (i.e. M is the 
adjacency matrix of a topological Markov chain with k states which is irreducible 
and aperiodic). Let X^o be the space of (right) infinite sequences x := xq^xi, ■ ■ ■ 
satisfying M(x„, a;„_|-i) = 1 for all n, and let T be the shift operator on X^o- As 
before, we can metrize Xoo by d(x,x') = a^^^'^ ■* for some fixed a > 1, and observe 
that the action of T on Xqo is mixing. This means that for all nonempty open sets 
U,V d Xoo there is N so that T-"'{U) n F is nonempty for all n> N. Note that 
if M is irreducible but not aperiodic, there is nevertheless a decomposition of X^ 
into D disjoint components which are cycled by T, and such that is mixing on 
each component, where D is the gcd of the periods of T-invariant sequences. 

Let Mt be the space of T-invariant probability measures on Xoo- This is a 
convex, compact subset of the space of all measures in the weak-* topology. It is 
not hard to show that the topological entropy h of T is equal to the supremiim of 
the measure theoretic entropies sup^^jy^^ h{fj,), and that h = log A where A is the 
Perron-Frobenius eigenvalue of M; see e.g. [36]. 

The shift T uniformly expands Xqo by a factor of a, and therefore if a function 
on Xoo is sufficiently regular, it tends to be smoothed out by T. Define T*f by 
T* f{x) = f o Tx. We would like the iterates {T^-y f to have a uniform modulus 
of continuity; this is achieved precisely by insisting that / be Holder continuous, 
that is, that there is some a so that \f{x) - f{x')\ < Cd{x,x'Y = Ca-"(^l^'). 
The set of functions / on X^o, Holder continuous of exponent a, is a Banach space 
with respect to the norm ||/||oo + I1/I!q where ||/||q is the least such C so that 
\f{x) - f{x')\ < Cdix,x')". We denote this Banach space C"(Xoo). 

Definition 3.7.1. Let / be Holder continuous on X^o- The pressure of /, denoted 
P(/), is P{f) = snp^^^^ihifi) + Jfdfj,). 

It turns out that the suprc^mum is realized on some invariant measure ^/ of full 
support, known as the equilibrium state (or Gibbs state) of /. That is, P{f) = 
h{nf) + J fdfif. See e.g. [4] Ch. 1 for a proof of this theorem, and of Theorem 3.7.3 
below. 

Definition 3.7.2. The Ruelle transfer operator Lf associated to / is defined by 
the formula Lfg(x) = YIiTx'^x ^^^^ ^9{^')- Note that Lf acts as a bounded linear 
operator on C"(Xoo). 

Theorem 3.7.3 (Ruelle-Perron-Probenius [36]). The operator Lf has a simple 

positive eigenvalue e^^^^ which is strictly maximal in modulus. The essential spec- 
trum is contained in a ball whose radius is strictly less than e^^^\ and the rest of 
the spectrum outside this ball is discrete and consists of genuine eigenvalues. 
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There is a strictly positive eigenfunction -0/ satisfying Lftpf ~ e^^-^^ipf, and an 
"eigen probability measure" Vf satisfying L'^ff = e^^^^Uf, and if we scale tpf so 
that J i^fduf = 1, then the equilibrium state /if is equal to I'fil'f- 

Remark 3.7.4. Vf can be thought of as a left eigenvector for Lf, and tpj as a right 
eigenvector. When / is identically zero, Lf is basically just the matrix M, and iJ.f 
is basically just fj, as constructed in § 3.4. 

PoUicott [34] proved a complexified version of the RPF theorem, and showed 
that P{f) and ipf are analytic on an open subset of the complex Banach space 
C"(Xoo, C) which contains a neighborhood of C°'{X^,R) (i.e. of C"(Xoo)). 

Because of the simplicity and analyticity of the maximal eigenvector /value, one 
can study the derivatives of pressure. For simplicity, let P{t) := P{tf + g). Then 
we can compute 

P'(0) = j fd/Ig 

and a further differentiation gives 

P"(0)= J f^ + 2fw'{0)dfig 

where w{t) = iptf+g (suitably normalized). 

Now, let Fn{x) = X]"=o /(^'^)- Then from the definition of the transfer opera- 
tor, L"j^g(-) = Lg(e*^"-), and therefore one obtains 

nP"(0)= J p2 + 2P„w;'(0)dM9 

If we set g to be identically zero, then fig is just the equilibrium measure /i from 
before. If we change / by a constant f — J fdfj, to have mean 0, then the ergodic 
theorem shows (l/n)P„ /x-a.e. and therefore 

P"(0) = lim / -F^dn 

n^oc J n 

It is usual to denote this limiting quantity by a^. 

The analyticity of P lets us control the higher moments of P„ in a uniform 

manner, and therefore by applying Fourier transform, one obtains a central limit 
theorem n~^/-^P„ — >• N{0,a). Better estimates of the rate of convergence can be 
obtained by studying P"'(0); see [11]. 

This theorem can be combined with Lemma 3.6.2 to obtain a central limit the- 
orem for certain functions on hyperbolic groups whose (discrete) derivatives along 
a combing satisfy a suitable Holder continuity property. Such functions arise nat- 
urally for groups acting cocompactly on CAT{K) spaces with K < 0, where one 
wants to compare the intrinsic geometry of the space with the "coarse" geometry 
of the group. 

Let Z be a complete CAT{K) geodesic metric space with K <0, and let G act 

cocompactly on Z by isometrics. Pick a basepoint z Z, and define a function 
P on G by F{g) = d{z,gz). Since G is hyperbolic, if we fix a finite generating 
set S we can choose a geodesic combing L with respect to S as above. Now, for 
any s € S define DsF(g) = F{g) — F{sg). It is straightforward to see from the 
CAT{K) property that there are constants C and a (depending on K and G) so 
that \D,F{g) - D,F{h)\ < Ga-"(sl^) for all s and all g,h€G. 
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An element of UX„ corresponds to a path in F. Reading the edge labels deter- 
mines a word in the generators (a suffix of some word in L), and by evaluation, an 
element of G. Let E : UX„ — >■ G denote this evaluation map (note that this is not 
injective). We can define a function DF on UX„ by DF{x) = DsF{E{x)) where 
is the label associated to the transition from xq to xi (we could suggestively 
write s = x^^x^). Evidently, DF extends to a Holder continuous function on X. 
Furthermore, for each y e F„, we have Y^^Zq DF{T^y) = F{E{y)). 

For each big component C, it follows that u-a.e. y G Voo(C) are Accrc typical 
(for the function DF) for some Ac,(tc depending only on C. Since F is Lipschitz 
on G in the left and right invariant metrics, the argument of Lemma 3.6.2 implies 
that Ac,(Jc are equal to some common values A,(j, and therefore we obtain the 
following corollary: 

Corollary 3.7.5. Let Z he a complete CAT(i^) geodesic metric space with K < Q, 
and let G act cocompactly on Z by isometrics. Pick a basepoint z G Z , and a 
finite generating set S for G. Then there are constants A and a so that if gn is 
a random, element of G„ (in the v measure), there is convergence in probability 

n-^l^{d{z,gnz) - An) iV(0,CT). 

Evidently, the only properties of the function F we use are that it is Lipschitz in 
both the left- and right-invariant metrics, and satisfies a Holder estimate \DsF{g) — 
DsF(h)\ < Ca-"-'^s\h) fQj. ^ heG. Any such function on a hyperbolic 

group satisfies a central limit theorem analogous to Corollary 3.7.5. For the sake 
of completeness, therefore, we state this as a theorem: 

Theorem 3.7.6 (Holder central limit theorem). Let G he a hyperbolic group, and 
S 0, finite generating set for G. Let F he a real-valued, function which is Lipschitz 
in both the left- and right- invariant word metrics on G, and satisfies \DgF{g) — 
DsF{h)\ < Ca^"*^^!''' for all s in S and all g,h £ G. Then there are constants 
A and a so that if (/„ is a random element of G„ (in the v measure), there is 
convergence in probability n~^^'^{F{gn) — An) — > A^(0, a). 

Remark 3.7.7. The idea of using the thermodynamic formalism to study the rela- 
tionship between distance and word length in cocompact groups of isometrics of 
hyperbolic space is due to PoUicott-Sharp [35]; Corollary 3.7.5 and Theorem 3.7.6 
above are simply the result of combining their work with [12] and [8]. Nevertheless, 
we believe they are new. 

4. Random walks 

The main references for this section are Kaimanovich [22] and Kaimanovich- 
Vershik [23]. The theory of random walks is a vast and deep subject, with connec- 
tions to many different parts of mathematics. Therefore it is necessary at a few 
points to appeal to some standard (but deep) results in probability theory, whose 
proof lies outside the scope of this survey. A basic reference for probability theory 
is [41]. We give more specialized references in the text where relevant. 

This section is brief compared to the earlier sections, and is not meant to be 
comprehensive. 

4.1. Random walk. Let G be a group and let /i be a probability measure on G. 
We further assume that jj is nondegenerate; i.e. that the support of /x generates 
G as a semigroup. An important example is the case where n is the uniform 
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measure on a symmetric finite generating set S. There are two ways to describe 
random walk on G determined by /x: as a sequence of elements visited in the 
walk, or as a sequence of increments. In the first description, a random walk 
y := id, yi, 2/2, • ' ' is a Markov chain with state space G, with initial state id, and 
with transition probability Pgh = iJ,{g~^h). In the second description, a random 
walk z := zi, Z2, - ' ' is a sequence of random elements of G (the increments of the 
walk), independently distributed according to /U. The two descriptions are related 
by taking Un = Z1Z2 ■ ■ ■ Zn- We write this suggestively as z = Dy and y = Hz. 

We use the notation (G^,//^) for the product probability space, and (G^,P) 
for the probability space of infinite sequences with the measure P on cylinder sets 
defined by 

P{{y : y begins id,yi,-- - ,y„}) = PidyiPyiy2 ■ ■ ■ Pyn-ivn 

With this notation, 2; is a random element of {G^,n^) and y is a random element 
of (GN,P). 

The shift operator T acts on G^ by {Tz)n = or {Ty)^ = yn+i- It is 

measure preserving for /U^ but not for P; in fact, from the definition, the support 
of P is contained in the set of sequences starting at id. The action of the shift T on 
(G^, /Lt^) is ergodic. For, if A is a subset satisfying A = T~^{A), then a sequence z 
is in A if and only if T'^{z) is in A for sufficiently big n. This is a tail event for the 
sequence of independent random variables Zi, so by Kolmogorov's 0-1 law (see [41] 
Thm. 1.1.2) A has measure or 1. 

Definition 4.1.1. Let G be a group and S a finite generating set. Let be a 
probability measure on G. The first moment of /x is ^ |g|/Lt(g); if this is finite, we 
say n has finite first moment. 

Lemma 4.1.2. Let ^ be a probability measure on G with finite first moment. Let 
id,yi, j/2, • • • be a random walk determined by fx. Then L := lim„^oo \yn\/n exists 
almost surely, and is independent of y. In fact, if /k*" denotes n-fold convolution 
(i.e. the distribution of the random variable yn), then L = lim„_).oo Yl lfi'lM*"(fl')- 

Proof. We set z = Dy. Define hn{z) := Then /i„ satisfies 

K+miz) < hn{r " Z) + hra{z) 

i.e. hn form a subadditive cocycle. Kingman's subadditive ergodic theorem (see e.g. 
[40]) says that for any subadditive cocycle /i„ on a space with a T-invariant 
measure, the limit limn->.oo hn{z) / n exists a.s. and is T-invariant. In our circum- 
stance, finite first moment implies that hi (and all the /i„) are in L^, so the theorem 
applies. Since the action of T on {G^, fi^) is ergodic, the limit is independent of z. 
The lemma follows. □ 

L as above is called the drift of the random walk associated to fi. Since each 

> we necessarily have L > 0. 

Example 4.1.3. If G = Z" and /i is symmetric (i.e. iJ,{g) = ix{g~^) for all g) with 

finite support, then L = 0. 

We now focus our attention on the case of hyperbolic groups and simple random 
walk (i.e. when fj, is the uniform measure on a finite symmetric generating set). 
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Lemma 4.1.4. let G be a nonelementary hyperbolic group, and let n be a nonde- 
generate probability measure on G with finite first moment. Then the drift L of 
random walk with respect to fj, is positive. 

Proof. We give the idea of a proof. Let /i*" denote the n-fold convolution of /i as 
before. The probabihty measures /i*" have a subsequence converging to a weak 
Umit in G. Clearly the support of is contained in dooG (a group for which 
linisup„^QQ > for any g and for /i nondegenerate is a finite group). 

To prove the lemma it suffices to show that for any C, for sufficiently large enough 
n there is an inequality h(\^9\ ~ l'''l)/"*"(fl')M*^('i) > C > for all A'' > n, since 
then L > C/n. Now, for each h, if g satisfies \hg\ — \h\ < C, then the closest point 
on the geodesic from to g is within 6 of some geodesic from id to h^^. So as \g\ 
goes to infinity, the a-distance from to g goes to 0. Hence for this inequality 
to fail to hold, almost half of the mass of /i*" x /x*^ must be concentrated near the 
antidiagonal; i.e. the set of {g,g~^) C G x G. 

From this we can deduce that either the desired inequality is satisfied, or else 
most of the mass of /i*" must be concentrated near a single geodesic through id. 
Taking n — >• oo, the support of must consist of exactly two points, and G is 
seen to be elementary, contrary to hypothesis. □ 

Remark 4.1.5. It is a theorem of Guivarc'h (see [46], Thm. 8.14) that if G is any 
group with a nondegenerate measure n (always with finite first moment) for which 
the drift of random walk is zero, then G is amenable. Some care is required to 
parse this statement: on an amenable group some nondegenerate measures may 
have positive drift, but on a nonamenable group, every nondegenerate measure has 
positive drift. 

A nonelementary hyperbolic group always contains many nonabelian free groups, 
and is therefore nonamenable; this gives a more highbrow proof of Lemma 4.1.4. 

Lemma 4.1.6 (Kaimanovich [22] 7.2). Let X be a S -hyperbolic space. The following 
two conditions are equivalent for a sequence Xn in X and a number L > 0: 

(1) d{xn,Xn+i) < o(n) and d{xo,Xn) = nL + o{n); 

(2) there is a geodesic ray 7 so that d{xn,^{Ln)) = o{n). 

A sequence Xn satisfying either condition is said to be regular. 

Proof. That (2) implies (1) is obvious, so we show that (1) imphes (2). For sim- 
plicity, we use the notation \y\ := d{xo,y). The path obtained by concatenating 
geodesies from Xn to Xn+i has finite a-length, and therefore converges to some 
unique x^o d^oX. 

Let 7„ (resp. 700) be geodesic rays from the origin to x„ (resp. Xoo) and 
parameterize them by distance from the origin. Fix some positive e, and let TV = 
iV(e) be such that for any two n,m > N the geodesies 7„ and 7^ are within i5 on 
the interval of length {L — e)n, and let p„ = 7„((i — e)n) so that 7^) < 5 for 
n> N. Now, d{pn^i,pn) < L — 6 + 4(5 and therefore c?(p„,pm) < \n — m\{L — e + 4:S). 
On the other hand, d{pn,Pm) > \\Pm\ — \Pn\\ = \n — m\{L — e). Consequently the 
sequence pi is a quasigeodesic, and therefore there is a constant H = H{6, L) so that 
d{pn,PNXoo) < H for any n > N. Since pNXoo and 70c are asymptotic, d{pn, 700) < 
if + (5 for sufficiently large n, and therefore d{xn,^ao) < H + 6 + {\xn\ — n{L — e)) 
for sufficiently large n. Taking e — )• proves the lemma with 7 = 700. □ 
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Together with Lemma 4.1.4 this gives the foUowing CoroUary: 

Corollary 4.1.7 (Kaimanovich [22] 7.3). Let G be a nonelementary hyperbolic 
group, and let n he a nondegenerate probability measure on G with finite first mo- 
ment. Then there is L > so that for a. e. random walk y there is a unique geodesic 

ray with d{yn,jy{Ln)) = o{n). 

Proof. It suffices to show that if /U has finite first moment, then d{yn, yn+i) = o(n) 
almost surely. Let z = Dy, and for any e > let En be the event that |2;n| > en. 
Then the probability of En is J2\g\>en M(fl')) and therefore 

n n \g\>en 

Therefore by the easy direction of the Borcl CanteUi lemma (see e.g. [41] 1.1.4) 
the probability that En occurs infinitely often is zero. Since this is true for every 
e, we have d{yn,yn+i) = \zn\ = o(n) almost surely. □ 

4.2. Poisson boundary. Define an equivalence relation ~ on by y ~ y' if and 
only if there are integers k, k' so that T'^y = T'^ y'. 

Definition 4.2.1. The measurable envelope of ~ is the smallest measurable equiv- 
alence relation generated by ^. The quotient measure space (F, z/) of (G^,P) by 
the measurable envelope is called the Poisson boundary of G with respect to /i. 

In other words, i/- measurable functions on T correspond precisely to T-invariant 
P-measurable functions on G^. We let bnd : G^ ^ F be the quotient map, so that 
bnd P = z/. 

Now, G acts on G^ on the left coordinatewise. This action commutes with T, 
and descends to an action on F. Since ^ is T-invariant, bnd P = bnd TP, so 
v = J2g lJ'{9)9^'i i-e- the measure u is ^,- stationary. 

Definition 4.2.2. A ii-houndary is a G-space with a /i-stationary measure A which 

is obtained as a T-equivariant (measurable) quotient of (G^,P). 

Any /i-boundary factors through (F. u). A //-boundary is n-maximal if the map 
from (F, v) is a measurable isomorphism. Kaimanovich [22] gave two very useful 
criteria for a /U-boundary to be maximal. 

Theorem 4.2.3 (Kaimanovich ray criterion [22] Thm. 5.5). Let B be a fx-boundary, 

and for y G G^ let Tl{y) G B be the image of y under the (G-equivariant) quotient 
map n : G^ B. If there is a family of measurable maps TTn ■ B G such that 
P-a.e. d(y„,7r„(n(y))) = o(n) then B is maximal. 

Together with Corollary 4.1.7, this gives the following important result: 

Corollary 4.2.4 (Kaimanovich [22] Thm. 7.6). Let G be a nonelementary hyper- 
bolic group, and let ii be a nondegenerate probability m,easure of finite first moment. 
Let n : G^ — >■ d^oG take a random walk to its endpoint (which exists P-a.e.), and 
let A = nP. Then {dooG, A) is the Poisson boundary of G,^,. 

Proof. Simply define 7r„ to be the maps that take a point y G dooG to jy{nL) where 
7y is a parameterized geodesic ray from id to j/, and L is the drift. □ 
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4.3. Harmonic functions. 

Definition 4.3.1. If / is a function on G, the operator (convolution with /i) 
is defined by P^f{g) '■— ^ function / on G is ^-harmonic (or 

just harmonic if /i is understood) if it is fixed by P^; i.e. if it satisfies f{g) = 
J2h fi9h)f^ih) for all g in G. 

In general we need to impose some condition on / for f{gh)fi(h) to be defined. 
If the support of fi is finite, then / can be arbitrary, but if the support of fi is infinite, 
we usually (but not always!) require / to be in We let H°°(G,ii) denote the 
Banach space of bounded /i-harmonic functions on G. 

In probabilistic terms, if / is harmonic and y € G^ is a random walk, the 
random variables /„ := /(?/«) arc a mMrtingale; i.e. the expected value of /„ given 
is fn-i (see e.g. [41] § 5.2 for an introduction to martingales). There is an 
intimate relation between harmonic functions and Poisson boundaries, expressed in 
the following proposition. 

Lemma 4.3.2. The Banach spaces H°°{G,fjb) and L°°{V,v) are isometric. 

Proof. Given / € H°°{G, fj,) and y G G^, the random variables /(y„) are a bounded 
martingale, and therefore by the martingale convergence theorem ([41] Thm. 5.2.22), 
converge a.s. to a well-defined limit. Evidently this limit is measurable and T- 
invariant, and therefore descends to a function on F which we denote /. Explicitly, 
/(bnd y) := lim„_>oo /(^n)- 

Conversely, given / e L°°{r,i') we define f{g) = /p/rf(3*i^) (this expression is 
known as the Poisson formula). Since is stationary, / is harmonic. 

The mean value property of harmonic functions implies that these maps are 
isometries, since a harmonic function achieves its maximum on the boundary. □ 

Note that the Poisson formula is available for any ^-boundary. That is, if B 
is a G-space with a /i-stationary probability measure A, and / is any element of 
L°°{B,X), then f{g) := Jg fd{g\) is a bounded harmonic function on G. If A is 
not invariant, / is typically nonconstant. 

The remainder of this section is devoted to some miscellaneous applications of 
random walks to hyperbolic and other groups. 

4.4. Green metric. There is a close resemblance between the measure v and the 
Patterson-Sullivan measures constructed in § 2.5. This resemblance can be sharp- 
ened if one looks at a natural metric on G adapted to the random walk, namely 
the so-called Green metric. 

Definition AAA. Let G be a group and /i a probability measure on G with finite 
first moment. The Green metric on G is the metric for which the distance between 
g and /i is — log of the probability that random walk starting at g ever hits h. 

If /U is symmetric, so is the Green metric, since random walks are time-reversible. 

Note that the Green metric is degenerate if random walk is recurrent. For simple 
random walk, this occurs only if G is finite, or is virtually Z or Z^, by a classical 
result of Varopoulos (see [45]). For nondegenerate measures with finite first moment 
on non-elementary hyperbolic groups, Blachere and Brofferio [1] show that the 
Green metric and the word metric are quasi-isometric (one needs to be somewhat 
careful: the Green metric is not in general a geodesic metric). 
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Theorem 4.4.2 (Blachcro-Hai'ssinsky Mathicu [2], Thm. 1.3). Let G be a non- 
elementary hyperbolic group, and for y e dcoG, let B{y,R) denote the ball of radius 
R in the a-metric (see Definition 2.4-6). Let ^ be a symmetric probability measure 
with finite first moment, and let v be the associated harmonic measure on d^oG. 
Then for u-almost every y G dooG, there is convergence 

lim log u{B{y,R))/ log R = iG/aL 

R — >-0 

where L is the drift in the word metric, and £g is the drift in the Green metric. 

Note that Kingman's subadditive ergodic theorem imphes that the drift £g with 
respect to the Green metric is well-defined, essentially by the same argument as the 
proof of Lemma 4.1.2. 

4.5. Harnack inequality. The classical Harnack inequality relates the values of a 
positive harmonic function at two points. In its infinitesimal version, it asserts an 
upper bound on the logarithmic derivative of a positive harmonic function. 

Let / be a non- negative bounded harmonic function on H", for simplicity. The 
Poisson formula says that f{p) = Jgn-i fdvp where Vp is the visual measure as 
seen from p. If v is visual measure as seen from the origin, and g is any isomctry 
taking the origin to p, then Vp = g^v. To understand how / varies as a function 
of p therefore, it suffices to understand how Vp varies as a function of p. If B is an 
infinitesimal ball centered at some point y in S'^^ , then the visual size of B grows 
like e**^"^^) as one moves distance t in the direction of y. Hence: 

Proposition 4.5.1 (Harnack inequality). Let f be a non-negative bounded har- 
monic function on H". Then the logarithmic derivative of f satisfies the inequality 
\dlogf\ < (n-1). 

If / is a non-negative harmonic function on a group G, the analog of this in- 
equality is f{gs)/f{g) < for any g & G and s & S where D is the dimension of 
v, which can be determined from Theorem 4.4.2. 

If S' is a closed surface of genus > 2 and p : tti{S) G is injective, then 7ri(S') 
acts on G by left translation, and there is an associated foliated bundle with fiber 
the ideal circle 9oo7ri(5) with its natural 7ri(S') action. We can build a harmonic 
connection for this circle bimdle; i.e. a choice of measure rUg on the circle S^{g) 
over each g G G so that for any subset A C we have mg{A) = J2 IJ'{s)mgs{A). 
Since the circle is 1-dimensional, these measures integrate to metrics on the circles 
S^{g) for which the curvature is harmonic. The Harnack inequality then gives a 
priori bounds on this curvature, and one can deduce local compactness results for 
families of injective surface maps of variable genus. For stable minimal surfaces 
in hyperbolic 3-manifolds, such a priori bounds were obtained by Schoen [38] and 
are an important tool in low-dimensional topology. The idea of using Harnack- 
type inequalities to obtain curvature bounds is due to Thurston [44] (also see [6], 
Example 4.6). 

4.6. Monotonicity. A norm on a group is a non-negative function r : G — > M 
so that T{gh) < T{g) -\- T{h) for all g,h G G. A functor from groups to norms is 
monotone if TH{4>{g)) < Tcig) for any g G G and (j) : G ^ H. 

If T is a norm on G, and /i is a probability measure with finite first moment, 
it makes sense to study the growth rate of r under /x-random walk on G. If G 
is finitely generated, one can study the growth rate of r under all simple random 
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walks; if they all have the same growth rate, this rate is an invariant of G. Since fx 
random walk on G pushes forward to random walk on (f){G) = H, the growth 
rate of a monotone family of norms cannot increase under a homomorphism; thus 
if the growth rate of tq on G is strictly smaller than the growth rate of th on H, 
there are strong constraints on the homomorphisms from G to H. 

As an example, consider the commutator length cl. For any group G and any g 
in the commutator subgroup [G, G], the commutator length c\{g) is just the least 
number of commutators in G whose product is g (for technical reasons, one usually 
studies a closely related quantity, namely the stable commutator length; see e.g [7] 
for an introduction). 

One of the main theorems of [9] is as follows: 

Theorem 4.6.1 (Calegari-Maher [9]). Let G be hyperbolic, and let jj. be a non- 
degenerate symmetric probability measure with finite first moment whose support 
generates a nonelementary subgroup. There is a constant G so that if gn is ob- 
tained by random walk of length n, conditioned to lie in [G, G], then 

C-^n/ login) < cl(5„) < Gn/log(n) 

with probability 1 — 0{C~'^ ). 

Said another way, commutator length grows like n/log(n) under random walk 
in a hyperbolic group. Similar estimates on commutator length can bo obtained 
for groups acting in a suitable way on (not necessarily proper) hyperbolic spaces; 
the most important examples are mapping class groups and relatively hyperbolic 
groups. 

As a corollary, if H is any finitely generated group, and commutator length in 
H grows like o(n/ log(n)) for simple random walk (with respect to some generating 
set), then there are no interesting homomorphisms from H to any hyperbolic group 
G, and no interesting actions of H on certain hyperbolic complexes. 
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